vCenter version: 7.0.3.00800 Build: 20150588
k8s: v1.22.9+vmware.1 VMware Photon OS/Linux 4.19.225-3.ph3 containerd://1.5.11
Cluster size: Control plane: 3 best-effort-2xlarge, Workers: 10 best-effort-2xlarge
When I start deploying applications like elasticsearch-rally, cassandra, fio, vdbench, pgbench most of the nodes come under disk pressure evicting the pods.
I see following events on the nodes:
Events:
Type Reason Age From Message
---- ------ ---- ---- -------
Warning FreeDiskSpaceFailed 41m kubelet failed to garbage collect required amount of images. Wanted to free 729588531 bytes, but freed 0 bytes
Warning FreeDiskSpaceFailed 26m kubelet failed to garbage collect required amount of images. Wanted to free 687059763 bytes, but freed 0 bytes
Warning ImageGCFailed 21m kubelet failed to garbage collect required amount of images. Wanted to free 703636275 bytes, but freed 0 bytes
Warning FreeDiskSpaceFailed 21m kubelet failed to garbage collect required amount of images. Wanted to free 703636275 bytes, but freed 0 bytes
Warning FreeDiskSpaceFailed 16m kubelet failed to garbage collect required amount of images. Wanted to free 703996723 bytes, but freed 0 bytes
Warning ImageGCFailed 16m kubelet failed to garbage collect required amount of images. Wanted to free 703996723 bytes, but freed 0 bytes
Normal NodeHasDiskPressure 12m (x9 over 15h) kubelet Node tkgs-cluster-1-test-nodes-wtzl5-8d6d65695-2n2pp status is now: NodeHasDiskPressure
Warning FreeDiskSpaceFailed 11m kubelet failed to garbage collect required amount of images. Wanted to free 3352056627 bytes, but freed 0 bytes
Warning ImageGCFailed 11m kubelet failed to garbage collect required amount of images. Wanted to free 3352056627 bytes, but freed 0 bytes
Warning EvictionThresholdMet 7m41s (x29 over 15h) kubelet Attempting to reclaim ephemeral-storage
By default the root partition disk size is 16 GB. Is there any way to deploy the vSphere with Tanzu (TKGs) cluster with larger root partition.
I am able to reproduce the issue consistently in last 3 releases of vSphere with Tanzu including the recent one.