Ive updated an Article but its not published yet so here is the content...
How to monitor a server or UIM robot that seems to be in a hung state
The term "hung state" is too vague. This could mean many different things, all with different possible solutions:
For example, symptoms may include but are not limited to the following:
- A single application on the system is not responding
- You cannot log in remotely with RDP, yet all other functions are working.
- All web services are down but other services are working, e.g., print services
- UIM robot is installed for monitoring, but the controller.log shows that nothing is being written to the log
- Disk space is very low or disk/drive or file system may be full
In some cases, an application is in a bad state but the OS is still functioning and responds to ping requests.
Note that there is no single UIM probe that can handle all of these scenarios and send an alarm for them.
First, you will need to define what constitutes a 'hung state' and then see what can be done to monitor for that condition.
You may choose one or more options to monitor a server/robot/system that appears to be hung and not functional.
net_connect probe
If the application or the server itself is not 'responding,' you can monitor it with the net_connect probe to test availability by monitoring any given service that is expected to be running, and using its default port, e.g., RDP port 3389, WMI port 135, SSH port 22 (Linux/Unix), WebSphere app port 9443, etc.
If a robot is installed, and the UIM robot is in a hung state, the spooler which listens on port 48001 will not be responsive, so you can use the net_connect probe on a hub to check the robot on its default port 48001, and send an alert if the service is unresponsive.
sql_response (or sqlserver) probe
If an application should be inserting data into a database, possibly create an SQL query or checkpoint to check for records being added and alarm if no new records/rows are being added.
dirscan
If the application writes to a log file, set up a remote dirscan to monitor the log files for activity.
e2e_appmon probe
If the application or server is not 'responding,' monitor it with the e2e_appmon probe using a **** script
Once you clearly define what a 'hung state' is in your case and how you can ost effectively check for and alert on this problem, then a possible solution may be able to be found.
Ideally, best practice is to implement proactive monitoring and be warned BEFORE the server becomes hung due to one or more factors mentioned above.
Best Regards,
Steve
------------------------------
Support Engineer
Broadcom
US
------------------------------
Original Message:
Sent: Dec 21, 2023 11:59 PM
From: Nagaraju Settipalli
Subject: How to set alert or email for Down servers in UIM Console
Hi @Stephen Danseglio,
Thanks for looking ...
- When the robot is down we want an alerts, Suppose if the server is not reachable means ping alert will generate, but if the robot is running on the server and server is reachable, but server is showing red colour in our console for that we need email or INC to be create.
- can you suggest me what is best scenario for hung then we can set an alert. there are n number of scenarios if the server is going to be hung. do you have any possible scenario to create an alerts for hung.
- The INC triggered and after 10 min the INC has been auto resolved, but i have to pause atleast 10 min to auto resolved, how can we edit this. is there any key need to create in Raw config of sdgtw probe?
Please share your update on this.
Thanks
Raj.
Original Message:
Sent: Dec 21, 2023 01:05 PM
From: Stephen Danseglio
Subject: How to set alert or email for Down servers in UIM Console
Hi Raj,
Please provide a little more detail. Do you mean a hub/robot is not running, or a server is not reachable, server is hung, or something else?
Is there a particular alarm(s) youre receiving and you would lke to send an email for the server 'down' issue?
Thanks,
Steve
------------------------------
Support Engineer
Broadcom
US
Original Message:
Sent: Dec 21, 2023 12:48 AM
From: Nagaraju Settipalli
Subject: How to set alert or email for Down servers in UIM Console
Hi Community Team,
Can you please help on, how to set an email alert or Inc alert for Down servers in UIM console.
- suppose I have 10 servers in UIM, out of 10 servers, 5 servers are in red colour. for 5 servers we need to set email alerts. how to set the alert for down servers.
Thanks you
Raj.