Is any one have script to check the rorot and probe communication error issue.
i.e. when i see in nimsoft console all robot is showing green it means its running fine but if we open an any robot probe then we get the communication error.
we have around 1000+ servers and its not possible to me to check the status manually.
I use a custom LUA script driven by nas that does "port_list" callback to controller on every robot of every hub (that are not in maint mode) and checks that hdb, controller and spooler have ports (excluding hub robot).
I suggest you write a similar script.
I'm also writing a custom probe to does this and other health check functions.
Thanks for your reply can you please share that LUA script wich you have created for health check.
Unfortunately I can't in this case
Can you share the things you check for?
I'm developing my own dashboard to keep an eye on certain things happening in Nimbus like queues/subscribers Nimbus infrastructure etc