Hi,
we got a bunch of new DL380Gen10 that are directly connected to IBM V3700 storage (vSphere 6.7 U3 + latest updates, HPE image). The servers are equipped with StoreFabric SN1200E adapter (OEM of Broadcom LPe31000-M6). First we ran into a problem where no LUN's were visible, I could find out that this was due to a driver version before lpfc 12.8.542.25. We are now using FW 12.8.542.32 and lpfc driver 12.8.614.2 (from Broadcom downloads, as HPE told us to use upstream versions, not HPE SPP etc version). Now LUN's / datastores are accessible.
But I still get a PSOD when rebooting/shutting down the server. There is something about lpfc driver in the trace, but as this happens at the very end of a reboot, no dump is written to local scratch location or netdump. Sometimes the server only hangs with "Shutting down device drivers...." forever. If either of the two happens monitoring is sending out an alert that one or both adapter ports are down (NicAllLinksDown Event from ILO). This does not happen during reboot when there is no PSOD. I tested different firmware / driver versions, it seems to make no difference. It also happend with no storage connected at all - disconnected FC cables.
My VMware case was closed after some time now, as VMware is pointing at HPE as the manufacturer. HPE is pointing at IBM and IBM's SSIC matrix (which contains no Proliant server for V3700 storage + vSphere). As well as Emulex as the manufacturer of the adapter.
Does anyone have an idea from looking at the PSOD what the problems might be? Any lpfc driver parameter to try out?
