Failover Hyper-V Cluster errors

Anonymous
2025-02-17T07:36:14+00:00

Setup and Infrastructure

I have three Microsoft Server 2019 based Failover Cluster having three nodes HP-SRV1, HP-SRV2 and HP-SRV3

This infrastructure was created 4 Years ago from past few month we are getting error and facing "Live Migration Machine" error / issue.

These all servers are on HP Server with Storage and all on same LAN connectivity with Cisco switch.

We have approx. 12 to 15 Virtual Machine on Linux, Windows Server.

Issue 1 - Error on Virtual Machine

"The Operation did not complete on resource Virtual Machine HMRS-PRODUCTION. Select the "Information Details" action to view events

Facing same error on three to four virtual machine, it looks like that subjected Virtual machine trying to move on different server.

Please note that subject Virtual machine are on Separate server like one machine is on HP-SRV1 and one is on HP-SRV2 but same error occurred.

Issue 2 - Live Migration

Live Migration failed from HP-SRV1 to HP-SRV3, HP-SRV2 to HP-SRV3 however i found that HP-SRV3 to HP-SRV1 successfully moved also HP-SRV3 to HP-SRV2 successfully moved.,

Troubleshooting

Found 21502 on HP-SRV1 and HP-SRV2 but not found on HP-SRV3

Error on HP-SRV2

Live migration of 'Virtual Machine Ad-Sync' failed.

Virtual machine migration operation for 'Ad-Sync' failed at migration destination 'HP-SRV3'. (Virtual machine ID 776248B9-3683-457E-A90D-4914A9A4BF1B

Live migration of 'Virtual Machine WIN-ADMIN-CENTER' failed.

Virtual machine migration operation for 'WIN-ADMIN-CENTER' failed at migration destination 'HP-SRV3'. (Virtual machine ID 6D5D28B5-F06C-494D-BC58-CD73290E9041)

Error on HP-SRV1

Live migration of 'Virtual Machine HRMS-PRODUCTION' failed.

Virtual machine migration operation for 'HRMS-PRODUCTION' failed at migration destination 'HP-SRV3'. (Virtual machine ID F7474C27-CF9C-4B62-8975-E0FEFD9C8D8B)

Log Name: System

Source: Microsoft-Windows-Hyper-V-High-Availability

Date: 16/02/2025 10:37:43 PM

Event ID: 21502

Task Category: None

Level: Error

Keywords:

User: SYSTEM

Computer: HP-SRV1.csaplho.pk

Description:

Live migration of 'Virtual Machine HRMS-PRODUCTION' failed.

===============================================================

Virtual machine migration operation for 'HRMS-PRODUCTION' failed at migration destination 'HP-SRV3'. (Virtual machine ID F7474C27-CF9C-4B62-8975-E0FEFD9C8D8B)

Event Xml:

21502

0

2

0

0

0x2000000000000000

258340

System

HP-SRV1.csaplho.pk

Virtual Machine HRMS-PRODUCTION

HRMS-PRODUCTION

Live migration of 'Virtual Machine HRMS-PRODUCTION' failed.

Virtual machine migration operation for 'HRMS-PRODUCTION' failed at migration destination 'HP-SRV3'. (Virtual machine ID F7474C27-CF9C-4B62-8975-E0FEFD9C8D8B)

=================================================================

Log Name: System

Source: Microsoft-Windows-Hyper-V-High-Availability

Date: 14/02/2025 6:05:41 PM

Event ID: 21502

Task Category: None

Level: Error

Keywords:

User: SYSTEM

Computer: HP-SRV2.csaplho.pk

Description:

Live migration of 'Virtual Machine Ad-Sync' failed.

Virtual machine migration operation for 'Ad-Sync' failed at migration destination 'HP-SRV3'. (Virtual machine ID 776248B9-3683-457E-A90D-4914A9A4BF1B)

Event Xml:

21502

0

2

0

0

0x2000000000000000

193193

System

HP-SRV2.csaplho.pk

Virtual Machine Ad-Sync

Ad-Sync

Live migration of 'Virtual Machine Ad-Sync' failed.

Virtual machine migration operation for 'Ad-Sync' failed at migration destination 'HP-SRV3'. (Virtual machine ID 776248B9-3683-457E-A90D-4914A9A4BF1B)

==================================================================================

Observations :

**All have same settings**********

PS C:\Windows\system32> Get-VMHost | Select-Object ComputerName, VirtualMachineMigrationAuthenticationType

ComputerName VirtualMachineMigrationAuthenticationType


HP-SRV3 CredSSP

PS C:\Windows\system32> Get-ClusterSharedVolume | Select-Object Name, State, OwnerNode

Name State OwnerNode


NOP-R5 Online HP-SRV1

OP-R5 Online HP-SRV2

PS C:\Windows\system32> Get-VMHost | Select-Object ComputerName, VirtualMachineMigrationEnabled, VirtualMachineMigrationAuthenticationType

ComputerName VirtualMachineMigrationEnabled VirtualMachineMigrationAuthenticationType


HP-SRV1 True CredSSP

PS C:\Windows\system32> Get-ItemProperty -Path "HKLM:\SOFTWARE\Microsoft\Windows NT\CurrentVersion" | Select-Object ProductName, DisplayVersion, CurrentBuild, ReleaseId

ProductName DisplayVersion CurrentBuild ReleaseId


Windows Server 2019 Standard 17763 1809

PS C:\Windows\system32> (Get-VMHost).VirtualMachineMigrationEnabled

True

====================================

On HP-SRV3

PS C:\Windows\system32> Get-NetAdapter | Select-Object Name, InterfaceIndex, Status

Name InterfaceIndex Status


ETHMIG2-SRV3 23 Up

V-MIGRATION 22 Up

ETH1-SRV3 21 Up

vEthernet (V-PROD) 17 Up

ETH2-SRV3 9 Up

ETHMIG1-SRV3 8 Up

ETHTEAM-SRV3 3 Up

On HP-SRV2

PS C:\Windows\system32> Get-NetAdapter | Select-Object Name, InterfaceIndex, Status

Name InterfaceIndex Status


vEthernet (V-PROD) 23 Up

ETH2-SRV2 18 Up

ETHMIG1-SRV2 14 Up

V-MIGRATION 11 Up

ETHTEAM-SRV2 5 Up

ETHMIG2-SRV2 4 Up

ETH1-SRV2 2 Up

On HP-SRV1

PS C:\Windows\system32> Get-NetAdapter | Select-Object Name, InterfaceIndex, Status

Name InterfaceIndex Status


ETH1-SRV1 22 Up

ETHTEAM-SRV1 21 Up

ETHMIG2-SRV1 19 Up

vEthernet (V-PROD) 18 Up

ETH2-SRV1 16 Up

ETHMIG1-SRV1 12 Up

V-MIGRATION 6 Up

**All Servers are able to ping each other through V-MIGRATION network

Get-VMHost | Select-Object ComputerName, VirtualMachineMigrationAuthenticationType

****

Please help me how can i resolve these error and Live Migration issues ?

Do i need to change settings for authentication ?

All Server Cluster logs attached in below link

Get-ClusterLog -Node HP-SRV3 -Timespan 10 -UseLocalTime

Cluster Logs

Windows Server High availability Clustering and high availability

Locked Question. This question was migrated from the Microsoft Support Community. You can vote on whether it's helpful, but you can't add comments or replies or follow the question. To protect privacy, user profiles for migrated questions are anonymized.

0 comments No comments
{count} votes

3 answers

Sort by: Most helpful
  1. Anonymous
    2025-02-18T06:59:56+00:00

    Hi,

    I only find some cxl::certstore::iskeyvalid(-2146893802) errors in the cluster log which is related to certificate or key management issues. Open certlm.msc and check for any expired or invalid certificates like CLIUSR under Personal and Trusted Root Certification Authorities. Run the cluster validation and make sure all tests are OK.

    If the issue persists, try evicting the node from the cluster and then add it back. Also try changing the delegation option to "Trust this computer for delegation to any services (Kerberos only)".

    0 comments No comments
  2. Anonymous
    2025-02-18T08:05:55+00:00

    @Ian_Xue,

    Unable to find any expired certificates as you mentioned.

    0 comments No comments
  3. Anonymous
    2025-02-25T06:07:39+00:00

    Dear Experts,

    Please help

    0 comments No comments