Hi all, I have spent quite a lot of time thinking about how vSAN handles certain failure scenarios. I have one scenario that I have had difficulty figuring out. In the example below, I have a stretched cluster vSAN setup, each AZ located in a dedicated physical site.
But in this setup, I am also doing a multi-rack architecture. Where the workload domains are spanned across physically separated racks. The design goal is to have a setup that could withstand a single rack failure (entire rack), or a complete site failure (2 entire racks within a site).
But what happens exact when we loose a single rack? If lets say, I am using a RAID-6 SPBM Policy (Dual Site Mirroring). If I was to loose an entire rack, I suspect that VMs with LOCAL affinity (no site mirroring) would HA restart on the surviving rack in that site. But VMs with SITE MIRRORING would restart on the other site.
The confusion comes from the fact that vSAN Stretched clusters define the datacenters as the fault domain, and are not "rack aware". But I am trying to figure out what can happen in this scenario. Its my understanding that you can use a multi-rack design, unless this is not fully supported.
Is this correct? / any thoughts?