DX Unified Infrastructure Management

 View Only

  • 1.  How to set alert or email for Down servers in UIM Console

    Posted Dec 21, 2023 12:49 AM

    Hi Community Team,

    Can you please help on, how to set an email alert or Inc alert for Down servers in UIM console.

    1. suppose I have 10 servers in UIM, out of 10 servers, 5 servers are in red colour. for 5 servers we need to set email alerts. how to set the alert for down servers.

    Thanks you

    Raj.



  • 2.  RE: How to set alert or email for Down servers in UIM Console

    Broadcom Employee
    Posted Dec 21, 2023 01:06 PM

    Hi Raj,

    Please provide a little more detail. Do you mean a hub/robot is not running, or a server is not reachable, server is hung, or something else?

    Is there a particular alarm(s) youre receiving and you would lke to send an email for the server 'down' issue?

    Thanks,

    Steve



    ------------------------------
    Support Engineer
    Broadcom
    US
    ------------------------------



  • 3.  RE: How to set alert or email for Down servers in UIM Console

    Posted Dec 21, 2023 11:59 PM

    Hi @Stephen Danseglio,

    Thanks for looking ...

    1. When the robot is down we want an alerts,  Suppose if the server is not reachable means ping alert will generate, but if the robot is running on the server and server is reachable, but server is showing red colour in our console for that we need email or INC to be create.
    2. can you suggest me what is best scenario for hung then we can set an alert. there are n number of scenarios if the server is going to be hung. do you have any possible scenario to create an alerts for hung.
    3. The INC triggered and after 10 min the INC has been auto resolved, but i have to pause atleast 10 min to auto resolved, how can we edit this. is there any key need to create in Raw config of sdgtw probe?

    Please share your update on this.

    Thanks 

    Raj.




  • 4.  RE: How to set alert or email for Down servers in UIM Console

    Broadcom Employee
    Posted Dec 22, 2023 12:58 PM
    Ive updated an Article but its not published yet so here is the content...
    How to monitor a server or UIM robot that seems to be in a hung state
     
    The term "hung state" is too vague. This could mean many different things, all with different possible solutions:
     
    For example, symptoms may include but are not limited to the following:
     
    - A single application on the system is not responding
    - You cannot log in remotely with RDP, yet all other functions are working.
    - All web services are down but other services are working, e.g., print services
    - UIM robot is installed for monitoring, but the controller.log shows that nothing is being written to the log
    - Disk space is very low or disk/drive or file system may be full
     
    In some cases, an application is in a bad state but the OS is still functioning and responds to ping requests.
     
    Note that there is no single UIM probe that can handle all of these scenarios and send an alarm for them.
     
    First, you will need to define what constitutes a 'hung state' and then see what can be done to monitor for that condition.
     
    You may choose one or more options to monitor a server/robot/system that appears to be hung and not functional.
     
    net_connect probe
    If the application or the server itself is not 'responding,' you can monitor it with the net_connect probe to test availability by monitoring any given service that is expected to be running, and using its default port, e.g., RDP port 3389, WMI port 135, SSH port 22 (Linux/Unix), WebSphere app port 9443, etc.
     
    If a robot is installed, and the UIM robot is in a hung state, the spooler which listens on port 48001 will not be responsive, so you can use the net_connect probe on a hub to check the robot on its default port 48001, and send an alert if the service is unresponsive.
     
    sql_response (or sqlserver) probe
    If an application should be inserting data into a database, possibly create an SQL query or checkpoint to check for records being added and alarm if no new records/rows are being added.
     
    dirscan
    If the application writes to a log file, set up a remote dirscan to monitor the log files for activity.
     
    e2e_appmon probe
    If the application or server is not 'responding,' monitor it with the e2e_appmon probe using a **** script
     
    Once you clearly define what a 'hung state' is in your case and how you can ost effectively check for and alert on this problem, then a possible solution may be able to be found.
    Ideally, best practice is to implement proactive monitoring and be warned BEFORE the server becomes hung due to one or more factors  mentioned above.

    Best Regards,
    Steve


    ------------------------------
    Support Engineer
    Broadcom
    US
    ------------------------------



  • 5.  RE: How to set alert or email for Down servers in UIM Console

    Posted Dec 27, 2023 04:02 AM

    Hi @Stephen Danseglio

    Thank for your support.

    Thanks,

    Raj




  • 6.  RE: How to set alert or email for Down servers in UIM Console

    Posted Dec 27, 2023 11:04 AM

    Hi All , 

    Good Day , 

    Let me know the differences between the MPN : HSMA-C191-T0000 & MPN : HSMA-C191-TU000 ? 

    what is the correct MPN  ? 

    Looking forward for your valuable support. 

    Regards 

    Manoj Balarkrishnan.