Greetings,
We have notifier configured to send emails on device downs to our NetCool server (and others) for ticketing purposes. Yesterday afternoon, we noticed that we were not receiving emails/ticketing for devices going down. When I looked in our NOTIFIER.OUT log, the last event that was received seemed to go through half way and then had the following text:
EventMessage: Thu 23 Mar, 2017 - 16:28:12 [Event 58d42fdc-1726-109c-0180-800005001bc9 is unavailable from Archive Manager: Response not received in time ]
Only displaying most recent of 3 event messages.
When I tried to stop Archive Manager from SCP on our Primary Server, it did not appear to respond...i.e. the stop Archive Manager remained grayed out after it was clicked.
When I eventually killed the Archive Manager process on our Primary Server, our Secondary server took over processing events. When I restarted the Archive Manager process on our Primary server, it did not immediately take over processing events. We left for the day assuming that event processing would remain on our Secondary server. At some point during the evening, the Primary returned to processing events and emails were being sent as expected.
Has anyone else experienced this behavior since moving to 10.2? We have had other issues with Archive Manager between our Primary and Secondary servers. However, we have applied all the recommended patches to address these. If anyone from CA is reading, does this point to Archive Manager slowing down/hanging? Is there a way to troubleshoot if this occurs again?
Thanks,
Michael