DX NetOps

 View Only
  • 1.  Spectrum - DEVICE HAS STOPPED RESPONDING TO POLLS

    Posted Dec 10, 2019 11:44 AM
    In our environment these alerts go off non stop and we have an integration with SNOW for auto-ticketing. 
    This causes an issue because by the time an engineer goes to check the alert has already cleared. 
    We had to turn off auto-ticketing for this particular which isn't ideal. 
    I tried created an event rule to address the issue but I was advised that it could cause irregular behavior with Spectrum. 
    DCM Timeout only goes to 1 minute so that isn't helpful. Polling is 5 minutes by default and we can't push that much further. 
    Has anyone seen this issue and addressed it?

    ------------------------------
    Thanks

    Sam Johnson
    ------------------------------


  • 2.  RE: Spectrum - DEVICE HAS STOPPED RESPONDING TO POLLS
    Best Answer

    Broadcom Employee
    Posted Dec 10, 2019 05:49 PM
    Sam,

    I am not familiar with the SNOW integration but if it uses AlarmNotifier you can use SANM to age the alarm before sending it to AlarmNotifier. 

    If you can configure which alarms to send to SNOW, you can ADD an even pair rule to the 0x10d35 event looking for the 0x10d30 event to be generated within nnn seconds. If the 0x10d30 event is not generated within nnn seconds, generate a new custom critical event and send that one to SNOW. Your new event/alarm could be "Device lost contact for more than 5 minutes".

    Make sure the 0x10d30 event clears your custom alarm.

    You can ADD to the out of the box configuration of the 0x10d35 event as long as you do not change the out of the box behavior. 

    I did the following in the lab (I set the timer to 60 seconds not 300 for testing):

    0xfff00034 E 0 A 3,0xfff00034,N
    0x00010d30 E 50 C 0x00010003 C 0x00010004 C 0x00010007 C 0x00010009 C 0x003d0000 C 0xfff00034
    0x00010d35 E 75 A 3,0x00010009,N R CA.EventPair, 0x10d30, "0xfff00034 -:-", 60

    Here are the results:

    The out of the box configuration of the 0x10d30 and 0x10d35 events have not changed so there will be no impact on fault isolation.

    Joe



  • 3.  RE: Spectrum - DEVICE HAS STOPPED RESPONDING TO POLLS

    Posted Dec 13, 2019 11:28 AM
    Hi, sorry for the late reply. 
    I tried modifying 0x10d35 and it impacted the fault isolation in my environment. 
    Not sure if i configured it correctly.

    ------------------------------
    Thanks

    Sam Johnson
    ------------------------------



  • 4.  RE: Spectrum - DEVICE HAS STOPPED RESPONDING TO POLLS

    Posted Dec 11, 2019 02:17 AM
    We also generate SNOW tickets for critical alarms and had the same issue. Now we use a separate notifier with a delay of 10 minutes. So alarms that clear before those 10 minutes won't generate any ticket.


  • 5.  RE: Spectrum - DEVICE HAS STOPPED RESPONDING TO POLLS

    Posted Dec 13, 2019 11:29 AM
    How did you go about creating the notifier?
    I haven't had much exposure to SANM.

    ------------------------------
    Thanks

    Sam Johnson
    ------------------------------