Application availability in VMware Engine

Google Cloud VMware Engine provides availability for your applications running on VMware in your private cloud environment. The following table lists failure scenarios and the associated availability features that help protect your applications.

Failure scenario Application protected? VMware Engine HA feature VMware HA feature Google Cloud feature
Disk failure Yes Fast replacement of failed node About the vSAN Default Storage Policy
Fan failure Yes Redundant fans, fast replacement of failed node
NIC failure Yes Redundant NIC, fast replacement of failed node
Host power failure Yes Redundant power supply
ESXi host failure Yes Fast replacement of failed node VMware vSphere High Availability
VM failure Yes VMware vSphere High Availability Load balancer for stateless VMware VMs
Leaf switch port failure Yes Redundant NIC
Leaf switch failure Yes Redundant leaf switches
Rack failure Yes Placement groups
Network connectivity to on-premises Yes Redundant networking services Redundant Dedicated/Partner Interconnect circuits
Network connectivity Yes Redundant Dedicated/Partner Interconnect circuits
Datacenter failure Yes Availability zones
Regional failure Yes Hosting regions

Availability features

Fast replacement of a failed node

Control plane software continuously monitors the health of VMware clusters and detects when an ESXi node fails. It then automatically adds a new ESXi host to the affected VMware cluster from its pool of readily available nodes and takes the failed node out of the cluster. This functionality quickly restores the spare capacity in the VMware cluster, supporting the cluster's resiliency provided by vSAN and VMware vSphere High Availability (HA).

Placement groups

A user who creates a private cloud can select a region and a placement group within the selected region. A placement group is a set of nodes spread across multiple racks but within the same spine network segment. Nodes within the same placement group can reach each other with a maximum of two extra switch hops. A placement group is always within a single availability zone and spans multiple racks. The control plane distributes nodes of a private cloud across multiple racks based on best effort. Nodes in different placement groups are guaranteed to be placed in different racks.

Availability zones

VMware Engine private clouds are hosted in a user-selected Google Cloud location. These locations are composed of regions and zones. A region is a specific geographical location where you can host your resources. Each region has one or more zones; most regions have three or more zones.

Resources in different zones in a region are isolated from most types of physical infrastructure and infrastructure software service failures. Resources in different regions have an even higher degree of failure independence. You can design a robust system and distribute resources across different failure domains to protect your applications and data from data center failures.

Redundant networking services

All the Google networking services for the private cloud (including firewall, public IP addresses, internet, Dedicated Interconnect, Partner Interconnect, and Cloud VPN) are highly available and able to support the SLA.