ESXi

 View Only
  • 1.  Crash: Spin Count exceeded (domain0) - possible deadlock with PCPU 2

    Posted May 16, 2012 03:30 AM

    Hi All. This is my first message on the community forum. I have ran into the PSOD today runing ESXi 5.0.0 - build 702118 x86_64.

    I have tried ESXi 4.1.0 build 348481 x86_64 before and run into ?similar? and ran into deadlock: "Spin Count exceeded (coalesceLock) - possible deadlock with PCPU 1

    I have a core and collected the information form vSphere accourding to the KB provided by vmware. I am running evaluation version and doing this not for business. I have found that in order to submit a crash or any technicl support ticket I need to have a contract. I was planing to let evaluation expire and get back to the limited version of the vmware that offered for free. Crash I do not like and would like to submit it.

    Did I get it right that I will not be able to submit one unless I have a support contract? Sounds weird, because from vmware stand point it is almost a "free testing". 

    Could that be a hardware issue after all? Looks like I am getting different PCPU number failing every time. Yes, I am running a whitelisted hardware. Looking for advice.

    Thank you all!



  • 2.  RE: Crash: Spin Count exceeded (domain0) - possible deadlock with PCPU 2

    Posted May 16, 2012 05:55 AM

    Hmmm.. from the first stab on the PSOD, I am able to interpret that the e1000 nic in your server might be having an interrupt sharing with another device on your server and the other device [I am just taking a wild guess], might be your SCSI controller hosting the disks.

    In your BIOS, as per your vendor, can you try changing the interrupt allocation of the e1000 nic so that no same interrupts are being shared for this device.

    Also, the reason, I am saying is that vmkernel will mostly run on PCPU0 only and it is having a deadlock with another driver who is not releasing the interrupt while vmkernel has passed an interrupt to be acknowledged. this is as much as i can grab from your PSOD.



  • 3.  RE: Crash: Spin Count exceeded (domain0) - possible deadlock with PCPU 2

    Posted May 16, 2012 10:49 PM

    Thank you for the reply. I have taken PCI NIC network card and left the other PCI-X NIC inplace. I promisse to report if that indeed helped. It ussually happens once a week....

    I have found some other Linux related issues in this area: https://bugzilla.kernel.org/show_bug.cgi?id=38632

    I have ASUS P8H67M PRO that is described in that article. I hope that info helps somebody too.

    Thanks again!



  • 4.  RE: Crash: Spin Count exceeded (domain0) - possible deadlock with PCPU 2

    Posted Jun 01, 2012 02:19 PM

    After removing the NIC it is still going strong :smileyhappy:



  • 5.  RE: Crash: Spin Count exceeded (domain0) - possible deadlock with PCPU 2
    Best Answer

    Posted Jun 01, 2012 02:50 PM

    So it is resolved are is it still happening



  • 6.  RE: Crash: Spin Count exceeded (domain0) - possible deadlock with PCPU 2

    Posted Jun 02, 2012 10:31 PM

    looks like resolved, thanks for your help