DX Unified Infrastructure Management

 View Only

 How monitor a HA failover situation?

Luc Christiaens's profile image
Luc Christiaens posted Mar 22, 2023 06:11 AM

If you run with a HA protected main hub + 2 proxy servers with tunnels to all the hubs each protected by a HA probe

When HA is implemented (like in document: https://community.broadcom.com/enterprisesoftware/communities/community-home/digestviewer/viewthread?GroupId=1315&MessageKey=f28d1578-1ac9-45db-903d-b46ccbd5041c) you can find yourself running in failover mode without really remarking that you are in failover mode.

Question: What would be the best/ideal callbacks, db fields,.... to create a report/qos/dashboard that give a good overview of your HA status?

I would like to create an automated script that gives you such overview, but only the callback on HA probe: get_status is not really clear in what HA is doing.

Any suggestions, ideas?

Marius Nitu's profile image
Marius Nitu

Hi Luc,

As you are much more qualified than most of us, I don't think I can give you a useful idea, but I can use the opportunity to learn something.

I'm not sure what you actually need, but I have a few ideas that I would use:

  • Process the HA alarms; the last one should give an indication on what HA is doing, assuming that HA probe is running, which you can find out with get_status callback;
  • Monitor ha.log with logmon, similar to the option above;
  • Find out which data_engine probe is running
  • Check if primary hub is available