Hi,
I am trying to do a compute and svMotion of a multi TB VM (FTT1) from one vSAN cluster to another over 1Gbit uplink.
After several hours (around 10 hours and was more than 50%) it fails with Event: Migration to host xx.xx.xx.xx failed with error Connection closed by remote host, possibly due to timeout (195887167)
Source vSAN is 6.2U2 and destination vSAN is 6.7U2
vmkping -I does not show any ping loss from source to destination vmk when I test RTT min=0173 avg=0.460 max=1.218
Both cluster/hosts connected to the same switch
VM is not busy but still powered on, does it help if I try again with the VM powerd off?
I selected "Schedule vMotion with high priority (recommended) - not sure if instead I select "Schedule regular vMotion" would help?
I am thinking this might be saturation the uplink during the migration and timing out, could that be the case and would it not lower the transfer speed if so?
Any specific logs I should check or any other methods I can try to migrate?
I tested a migration (compute and svMotion) of a very small VM from the same cluster (same source host) to the same destination host in other cluster and migration worked fine, so think it must be failing because of its size.
On source host vmkernel I see events related to:
S: failed to read stream keepalive: Connection reset by peer
S: Migration considered a failure by the VMX. It is most likely a timeout...
Destroying Device for world xxxxxxxx
Destroying Device for world xxxxxxxx
disabled port xxxxxxxxxxxx
XVMotion: 2479 Timout out while waiting for disk 2's queue count to drop below the minimum limit of 32768 blocks. This could indicate network or storage problems...
Thanks