VMware vSphere

 View Only
  • 1.  Kernel Panic VMWare 8.0.3 Build 24091160 with HPE ProLiant DL385 Gen11 Bios: A1.32

    Posted Aug 19, 2024 08:02 AM
      |   view attached

    Hello Community,

    we have Problems with an updated VMWare vSphere from 8.0 to 8.0.3. The Server will go down each 2-4 Days with Kernel Error. The details are displayed in the attachment.  Today we find no solution to fix this problem.

    Have you an Idea to fix that problem?

    Greetings,

    Thomas

    Attachment(s)

    pdf
    1445_VGVMware.pdf   92 KB 1 version


  • 2.  RE: Kernel Panic VMWare 8.0.3 Build 24091160 with HPE ProLiant DL385 Gen11 Bios: A1.32

    Posted Aug 20, 2024 01:06 AM

    Firstly, do you have a Broadcom support contract to have the logs investigated in detail?

    Normally when this happens, although it may have multiple sources, the main culprits are firmware version and driver version, both of which you should source from HP.

    Check your servers have the latest firmware and check the current firmware is compatible with the version of ESXi.

    Check your servers have the latest drivers and check the current drivers are compatible with the version of ESXi.

    Check if you have hardware faults logged on your HP via iLO.

    Ensure you have the latest ESXi patches applied and if not, please patch the servers.

    Also, check if you have a power saving feature on your server via the iLO. If it does then please disable for now.

    Does this happen for all hosts or just one?

    Are all hosts on the same patch level?




  • 3.  RE: Kernel Panic VMWare 8.0.3 Build 24091160 with HPE ProLiant DL385 Gen11 Bios: A1.32

    Posted Aug 20, 2024 02:21 AM

    Hello JDMils,

    Thank your for your message and your help.

    No I can't contract the Broadcom Support because they haven't transfer the licences to their portal. This was my first step to contact Broadcom to check that problem.

    The ESXI Image was load by HPE Image for this server. The iLO Logs was sent to HPE Support, they can't find everything that goes wrong,

    but in another Thread in HPE Forum they talk about the same problem with AMD EPYC with iLO and kernel panic.

    So today I have to look, that the Bios and the iLO come to actual state and then we will see.

    I'll inform you when the problem is solved.

    Thank you,

    Thomas




  • 4.  RE: Kernel Panic VMWare 8.0.3 Build 24091160 with HPE ProLiant DL385 Gen11 Bios: A1.32

    Posted Aug 29, 2024 04:19 AM
    Edited by Frank Hausser Aug 29, 2024 04:20 AM

    Hello Forum,

    we were abled to fix that problem, cause it's a driver fault from HPE or VMWare. We uninstalled the ilo driver in VMWare.

    Until December 2023 will be this problem with the ilo Driver and this will make an PSOD and the server froze.

    We hope that HPE or Broadcoam will fix that problem promtly in the future.



    ------------------------------
    Brendle Thomas
    ------------------------------



  • 5.  RE: Kernel Panic VMWare 8.0.3 Build 24091160 with HPE ProLiant DL385 Gen11 Bios: A1.32

    Posted Aug 30, 2024 09:44 AM

    Hi Thomas,

    Thanx for Sharing...

    We also had PSODs on DL380's Gen11. This was a bug in iLO 1.60 in combination with the NS204 NVMe Boot devices.

    HPE had us downgrade this version to 1.58 and the stability returned. I am now upgrading BIOS and iLO's again to the latest version (of which 4 pieces were released last 3 months... )

    Henry