DX Unified Infrastructure Management

 View Only
  • 1.  Routine maintenance

    Posted Jul 29, 2020 05:19 PM
    ​Hello Guys,
    I am new to UIM and still learning, Thanks to all of your valuable suggestions.

    I am looking for
    1.Things that we need to check daily as part of the UIM daily maintenance and support.
    2.What need to check when an robot or hub probe is updated with new hotfix or version.

    Thanks,

    ------------------------------
    Thanks
    ------------------------------


  • 2.  RE: Routine maintenance
    Best Answer

    Posted Jul 29, 2020 05:25 PM
    think the idea is to have it setup for self monitoring so all you need to do is just check for new alarms. 

    Best Practices for Monitoring CA UIM (self-health monitoring)
    Article Id: 9640
    https://ca-broadcom.wolkenservicedesk.com/external/article?articleId=9640

    ------------------------------
    Support Engineer
    Broadcom
    ------------------------------



  • 3.  RE: Routine maintenance

    Broadcom Employee
    Posted Jul 29, 2020 05:33 PM

    For #2 you can look at following on a given robot:

    C:\Program Files (x86)\Nimsoft\robot\pkg\versions.txt




  • 4.  RE: Routine maintenance

    Broadcom Employee
    Posted Jul 29, 2020 05:41 PM
    Here are a few more items to be aware of:

    CA Unified Infrastructure Management Hotfix Index
    https://support.broadcom.com/external/content/release-announcements/CA-Unified-Infrastructure-Management-Hotfix-Index/7233

    Sign up for Proactive Notifications for Unified Infrastructure Management
    https://support.broadcom.com/user/notifications.html

    - Critical Alerts
    - Product Advisories
    - Release Announcements
    - Security Advisories

    Steve

    ------------------------------
    Support Engineer
    Broadcom
    US
    ------------------------------



  • 5.  RE: Routine maintenance

    Posted Jul 30, 2020 01:12 AM
    Every morning we check: 
    - Is there any queuing on all my hubs (can be done with: https://community.broadcom.com/enterprisesoftware/viewdocument/queuecheck-lua-script-v22?CommunityKey=170eb4e5-a593-4af2-ad1d-f7655e31513b&tab=librarydocuments)
    - Is there enough disk space on all my UIM servers (done via cdm probe)
    - check sql response time and alert when > xx ms (done via small sql query in sql_response probe)
    - check if # daily alerts is not > xxx (where xxx is your normal number + 20%)  So you can trap an unseen alarm loop/storm (via sql_response)
    - check the # of qos entries and create alarm if > xxx (this would warn you if a custom probe or snmpcollector has some wrong setup) (via sql_response)
    - check if my daily data_engine maintenance is running fine (via logmon probe on data_engine.log)
    Most of these checks are done via Listviews with some colored markers to indicate possible problems 


    ​​