Hi there,
I have a Dell PowerEdge 2950 (running ESX4 Classic with all latest patches) connecting to an MD3000i that intermittently drops its connection to its storage, but after a moment regains connection. I know this because it sends out the alarms that it has lost connection, but when I go to check the vCenter, I notice that there is no problem other than the errors in the event log.
Further investigation in /var/log shows the following:
vmkiscsid.log
2009-08-26-13:03:02: iscsid: Nop-out timedout after 10 seconds on connection 3:0 state (3). Dropping session.
2009-08-26-13:03:06: iscsid: connection3:0 is operational after recovery (2 attempts)
vmkernel:
Aug 26 13:03:02 esx-beta vmkernel: 1:21:23:40.354 cpu3:4238)WARNING: iscsi_vmk: iscsivmk_StopConnection: vmhba33:CH:4 T:0 CN:0: iSCSI connection is being marked "OFFLINE"
Aug 26 13:03:02 esx-beta vmkernel: 1:21:23:40.354 cpu3:4238)WARNING: iscsi_vmk: iscsivmk_StopConnection: Sess
Aug 26 13:03:02 esx-beta vmkernel: 1:21:23:40.354 cpu3:4238)WARNING: iscsi_vmk: iscsivmk_StopConnection: Conn
Aug 26 13:03:05 esx-beta vmkernel: 1:21:23:42.748 cpu2:4098)NMP: nmp_CompleteCommandForPath: Command 0x12 (0x410002019e40) to NMP device "mpx.vmhba1:C0:T0:L0" failed on physical path "vmhba1:C0:T0:L0" H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x24 0x0.
Aug 26 13:03:05 esx-beta vmkernel: 1:21:23:42.748 cpu2:4098)ScsiDeviceIO: 747: Command 0x12 to device "mpx.vmhba1:C0:T0:L0" failed H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x24 0x0.
Aug 26 13:03:06 esx-beta vmkernel: 1:21:23:43.712 cpu3:4238)WARNING: iscsi_vmk: iscsivmk_StartConnection: vmhba33:CH:4 T:0 CN:0: iSCSI connection is being marked "ONLINE"
Aug 26 13:03:06 esx-beta vmkernel: 1:21:23:43.712 cpu3:4238)WARNING: iscsi_vmk: iscsivmk_StartConnection: Sess
Aug 26 13:03:06 esx-beta vmkernel: 1:21:23:43.712 cpu3:4238)WARNING: iscsi_vmk: iscsivmk_StartConnection: Conn
Aug 26 13:03:08 esx-beta vmkernel: 1:21:23:45.920 cpu3:4099)NMP: nmp_CompleteCommandForPath: Command 0x12 (0x41000205fe80) to NMP device "mpx.vmhba1:C0:T0:L0" failed on physical path "vmhba1:C0:T0:L0" H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x24 0x0.
Aug 26 13:03:08 esx-beta vmkernel: 1:21:23:45.920 cpu3:4099)ScsiDeviceIO: 747: Command 0x12 to device "mpx.vmhba1:C0:T0:L0" failed H:0x0 D:0x2 P:0x0 Valid sense data: 0x5 0x24 0x0.
vmkwarning:
Aug 26 13:03:02 esx-beta vmkernel: 1:21:23:40.354 cpu3:4238)WARNING: iscsi_vmk: iscsivmk_StopConnection: vmhba33:CH:4 T:0 CN:0: iSCSI connection is being marked "OFFLINE"
Aug 26 13:03:02 esx-beta vmkernel: 1:21:23:40.354 cpu3:4238)WARNING: iscsi_vmk: iscsivmk_StopConnection: Sess
Aug 26 13:03:02 esx-beta vmkernel: 1:21:23:40.354 cpu3:4238)WARNING: iscsi_vmk: iscsivmk_StopConnection: Conn
Aug 26 13:03:06 esx-beta vmkernel: 1:21:23:43.712 cpu3:4238)WARNING: iscsi_vmk: iscsivmk_StartConnection: vmhba33:CH:4 T:0 CN:0: iSCSI connection is being marked "ONLINE"
Aug 26 13:03:06 esx-beta vmkernel: 1:21:23:43.712 cpu3:4238)WARNING: iscsi_vmk: iscsivmk_StartConnection: Sess
Aug 26 13:03:06 esx-beta vmkernel: 1:21:23:43.712 cpu3:4238)WARNING: iscsi_vmk: iscsivmk_StartConnection: Conn
Effectively, from what I can gather, it looses connection and regains connection 4 seconds later. Problem is I don't know why its doing this? Is there any way I can get further information regarding these drop outs? I don't think its the MD3000i since we currently have two other PE 2950's configured identically to this problem PE 2950 that are connected to the same MD3000i, that are displaying absolutely no faults at the same time that this machine shows an error. My configuration is identical to how is shown here: and I have jumbo frames enabled.
Thanks in advance for your help.
Gene