Failover Cluster Errors
Live Migration Failure & Event ID 21502 – Hyper-V Cluster
Setup Overview:**
I have a Microsoft Server 2019-based Failover Cluster with three nodes (SRV1, SRV2, SRV3). The infrastructure was set up four years ago, and recently, we have been encountering Live Migration failures across certain nodes.
All servers are connected to shared storage and operate on the same LAN using a Cisco switch. The cluster hosts approximately 12 to 15 virtual machines running both Linux and Windows Server.
Issue 1 – Event ID 21502 & Virtual Machine Migration Errors**
We are experiencing Live Migration failures with the following error messages:
Errors on SRV2:
Live migration of 'Virtual Machine Ad-Sync' failed. Virtual machine migration operation for 'Ad-Sync' failed at migration destination 'SRV3'. (Virtual machine ID 776248B9-3683-457E-A90D-4914A9A4BF1B).
Live migration of 'Virtual Machine WIN-ADMIN-CENTER' failed. Virtual machine migration operation for 'WIN-ADMIN-CENTER' failed at migration destination 'SRV3'. (Virtual machine ID 6D5D28B5-F06C-494D-BC58-CD73290E9041).
Errors on SRV1:
Live migration of 'Virtual Machine HRMS-PRODUCTION' failed. Virtual machine migration operation for 'HRMS-PRODUCTION' failed at migration destination 'SRV3'. (Virtual machine ID F7474C27-CF9C-4B62-8975-E0FEFD9C8D8B).
Issue 2 – Live Migration Status:
- Failing Migrations:
- SRV1 → SRV3
- SRV2 → SRV3
- Successful Migrations:
- SRV3 → SRV1
- SRV3 → SRV2
This suggests a possible issue with authentication, network settings, or cluster misconfiguration.
Observations & Configuration Details:
- Authentication Settings
- CredSSP authentication is enabled across all nodes:
- Cluster Shared Volume (CSV) State
- CSVs are online and assigned to respective nodes:
- Network & Connectivity Checks
- All servers can ping each other via the V-MIGRATION network.
- NIC configurations for Live Migration:
- All servers are running Windows Server 2019 Standard (Build 17763, Version 1809).
- Hyper-V host settings confirm that Live Migration is enabled.
Request for Assistance
- Could these migration failures be related to authentication settings? Should I consider switching to Kerberos authentication instead of CredSSP?
- Are there any specific configurations I should check to resolve Live Migration failures and prevent Event ID 21502 errors?
- Any recommendations for troubleshooting and resolving this issue would be greatly appreciated.