It's been a long time since I ran v6.7 but I seem to recall that your issue may have been a software bug.
You should be able to restart the host management agents and get them re-connected without having to reboot.
Paul Boserup
Senior Server Engineer
Information Systems - Technical Services
Sarasota Memorial Hospital
603-276-5329 mobile
*************************************************************************
Confidentiality Notice: the information contained in this email and any attachments may be legally privileged and confidential. If you are not an intended recipient, you are hereby notified that any dissemination, distribution, or copying of this e-mail is strictly prohibited. If you have received this e-mail in error, please notify the sender and permanently delete the e-mail and any attachments immediately. You should not retain, copy or use this e-mail or any attachments for any purpose, nor disclose all or any part of the contents to any other person.
Original Message:
Sent: 2/4/2025 12:00:00 PM
From: John D
Subject: ESXi Server Randomly Becomes "Not Responding" and VM Disconnection
I have a cluster of 8 ESXi 6.7 (14320388) servers, Dell R640. Occasionally, random servers go into a "not responding" status, and the virtual machines on them become "disconnected," although the virtual machines on the problematic server continue to run.
In the /var/log/hostd.log
file, there are many lines like this:
d[2595562] [Originator@6876 sub=IoTracker] In thread 2100290, access("/vmfs/volumes/642fde55-b53efb8c-836f-908d6ec63b42/catalog") took over 15503 sec.
d[2595562] [Originator@6876 sub=IoTracker] In thread 2100474, access("/vmfs/volumes/642fde55-b53efb8c-836f-908d6ec63b42/catalog") took over 12372 sec.
This is one of the Dell ME5084 datastores with HDD disks, and there are no alerts in vCenter indicating any errors. I cannot log in through the ESXi web interface because it times out. After entering the password in DCUI, it takes 7-10 minutes to log in. Additionally, when executing any list commands via SSH, the console hangs.
I have been able to resolve this issue by restarting the ESXi server, but I would like to know if there is a way to solve this problem without rebooting the host.