Facing issue while migrating azure postgres flexi server to GCP

Vaidehi Wadnerkar 0 Reputation points
2025-06-02T11:55:41.8166667+00:00

We are currently migrating data from an Azure Database for PostgreSQL – Flexible Server instance to Google Cloud PostgreSQL using Google’s Database Migration Service (DMS). A HA VPN tunnel has been successfully established between Azure and GCP, with BGP configured to dynamically exchange routes between both networks. The migration initiates without issue, and approximately 90 GB of data is transferred successfully. However, after this point, the DMS job consistently fails due to the BGP session going down. From Azure’s perspective, this indicates that the Azure Virtual Network Gateway is losing connectivity with the BGP peer on the GCP side, resulting in the withdrawal of learned routes and disruption of the VPN tunnel. This could be caused by an issue with BGP keepalive or hold timers, session stability under sustained load, or possible throttling or packet loss at the VPN gateway. We need assistance in identifying whether the Azure VPN Gateway is encountering any errors or resource constraints during the high-volume data transfer, and whether there are logs or diagnostics indicating why the BGP session is being terminated unexpectedly after prolonged traffic flow.

Azure Database for PostgreSQL
{count} votes

1 answer

Sort by: Most helpful
  1. PratikLad 1,585 Reputation points Microsoft External Staff Moderator
    2025-06-09T16:59:21.3266667+00:00

    Hello Vaidehi Wadnerkar

    Use Azure Network Watcher and VPN Gateway Metrics to check for resource constraints or tunnel failures:

    Navigate to: Azure Portal > VPN Gateway > Metrics Correlate the timing of the BGP drop with spikes in traffic or errors.

    • Ensure BGP keepalive and hold timers are consistent between Azure and GCP. Azure uses: Keepalive: 60 seconds, Hold Time: 180 seconds

    A mismatch or overly aggressive configuration on the GCP side may cause session timeout during congestion.

    • Confirm the SKU of your Azure VPN Gateway (e.g., VpnGw1, VpnGw2, etc.) supports your bandwidth needs.

    VpnGw1 supports up to ~650 Mbps. If you're migrating large volumes quickly, you may exceed this limit, causing performance degradation.

    • Consider upgrading to a higher SKU (e.g., VpnGw3 or VpnGw5) to increase performance headroom.

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.