Okay guys, here's the deal with my situation and this error.
Basically, we have an XIV backend provisioning volumes to an IBM n6040 (NetApp) which are seen on the netapp as an aggregate volume. This aggregate is carved into volumes on the NetApp and then Luns are created inside these volumes and presented to our ESXi environment. Now presented to the VMware HBA's, they are turned into datastores with the VM's placed thereon.
Okay, this error is a result of a SCSI timeout issue. If you look at your logs you'll probably see timeout errors. Typically, SCSI can come back with some "sense" data which can usually give a hint as to what the issue is.
In our case, we weren't getting any data. We'd just see the datastores timeout. The HBA target doesn't drop, no paths fail, it just times out and doesn't respond.
Again, in OUR case, it turned out to be the NetApp waiting on a storage shelf to process commands for our Exchance environment. Why you ask? Because on the NetApp everything has to traverse this interconnect which, duh, connects the two modules and makes it a cluster. Now, when our new Exchange environment started receiving a lot of data it pushed a lot of reads and writes onto this interconnect. Although the fiber-connected luns to the ESXi environment weren't exactly alinged on the netapp (part of the problem), this error would occur mostly because when the NetApp couldn't respond to the VMware luns because it was waiting on the ESX3000 storage unit to fill it's interconnect queue.
You see, we bought EXN3000 storage system (disk tray shelf for the netapp which connects via SAS) and it's disks only run at 7200 rpm. We are pushing wayyyyyy to many read and write ops onto these disk for them to keep up.
As we started loading data onto the exchange disks it then started to affect our ESXi environment.
The problem was always there but didn't manifest itself.
Our solution?
Move the VMware environment to straight XIV until we can better load balance the Exchange disk issues.
If anyone wants more details just PM me.
I have a nice set of commands that can help you diagnose this.