Thanks for your help already.
But in my case the VM's are also not responding, this is how I noticed that the main ESXi server was having issues.
This morning again, the same issue, system was still running, but containers were unavailble and the ESXi host didn't respond anymore also not to direct access via keyboard and display on the device. The display still shows the grayed out version, but the keyboard input is no longer working.
Rebooted this device, and have executed the command you mentioned:
/etc/init.d/sfcbd-watchdog stop
Besides this I found Issuing a 0x85 SCSI command from a VMware ESXi 6.0 host with the EMC XtremIO storage array may result in a PDL error (2133286) because I saw this error as well in the log "cpu3:524635)NMP: nmp_ResetDeviceLogThrottling:3782: last error status from device mpx.vmhba32:C0:T0:L0 repeated 1 times". So I also executed:
/etc/init.d/smartd stop
chkconfig smartd off
Because I'm using a USB drive to run ESXi from, I doubt that the device is still fully working. Perhaps this device has some issues what is causing ESXi to crash. I think I will also try to install ESXi on another USB device when the system freezes again, to check if another USB device can fix the problem. Because I see these errors inside the "vmkernel.log":
2021-11-26T20:26:22.652Z cpu1:524312)NMP: nmp_ThrottleLogForDevice:3856: Cmd 0x28 (0x453a411d66c0, 526185) to dev "mpx.vmhba32:C0:T0:L0" on path "vmhba32:C0:T0:L0" Failed:
2021-11-26T20:26:22.652Z cpu1:524312)NMP: nmp_ThrottleLogForDevice:3865: H:0x0 D:0x2 P:0x0 Valid sense data: 0x3 0x11 0x0. Act:NONE. cmdId.initiator=0x43034bcaccc0 CmdSN 0xc
2021-11-26T20:26:22.652Z cpu1:524312)ScsiDeviceIO: 4062: Cmd(0x453a411d66c0) 0x28, CmdSN 0xc from world 526185 to dev "mpx.vmhba32:C0:T0:L0" failed H:0x0 D:0x2 P:0x0 Valid sense data: 0x3 0x11 0x0.
2021-11-26T20:26:22.652Z cpu2:526651)Vol3: 1982: Couldn't read volume header from 60749f38-e295605f-dd74-78f29e90b189: I/O error
2021-11-26T20:26:22.653Z cpu2:526651)Vol3: 4226: Failed to get object 28 type 1 uuid 60749f39-70241bc6-d9cc-78f29e90b189 FD 0 gen 0 :I/O error
2021-11-26T20:26:22.653Z cpu2:526651)WARNING: Fil3: 1518: Failed to reserve volume f533 28 1 60749f39 70241bc6 f278d9cc 89b1909e 0 0 0 0 0 0 0
2021-11-26T20:26:22.653Z cpu2:526651)Vol3: 4226: Failed to get object 28 type 2 uuid 60749f39-70241bc6-d9cc-78f29e90b189 FD 4 gen 1 :I/O error
I used the command below to check which drives have which names:
esxcli storage core path list
And the errors are related to the USB drive as far as I can see
usb.vmhba32-usb.0:0-mpx.vmhba32:C0:T0:L0
UID: usb.vmhba32-usb.0:0-mpx.vmhba32:C0:T0:L0
Runtime Name: vmhba32:C0:T0:L0
Device: mpx.vmhba32:C0:T0:L0
Device Display Name: Local USB Direct-Access (mpx.vmhba32:C0:T0:L0)
Adapter: vmhba32
Channel: 0
Target: 0
LUN: 0
Plugin: NMP
State: active
Transport: usb
Adapter Identifier: usb.vmhba32
Target Identifier: usb.0:0
Adapter Transport Details: Unavailable or path is unclaimed
Target Transport Details: Unavailable or path is unclaimed
Maximum IO Size: 32768
pcie.100-pcie.0:0-eui.0000000001000000e4d25c5331cb5201
UID: pcie.100-pcie.0:0-eui.0000000001000000e4d25c5331cb5201
Runtime Name: vmhba1:C0:T0:L0
Device: eui.0000000001000000e4d25c5331cb5201
Device Display Name: Local NVMe Disk (eui.0000000001000000e4d25c5331cb5201)
Adapter: vmhba1
Channel: 0
Target: 0
LUN: 0
Plugin: HPP
State: active
Transport: pcie
Adapter Identifier: pcie.100
Target Identifier: pcie.0:0
Adapter Transport Details: Unavailable or path is unclaimed
Target Transport Details: Unavailable or path is unclaimed
Maximum IO Size: 131072
Will get back when I now more.