About accessing Vertex AI services through private services access

Vertex AI services that have a checkmark in the Private services access column of the Private access options for Vertex AI table require you to connect to their services through private services access.

These Google-managed Vertex AI services support bidirectional communication with a service consumer's on-premises, multicloud, and VPC workloads.

This private communication happens exclusively by using internal IP addresses. VM instances don't need internet access or external IP addresses to reach services that are available through private services access.

Vertex AI provides services that are hosted in a Google-managed VPC network. Private services access lets you reach the internal IP addresses of these Vertex AI and third-party services through a VPC Network Peering connection.

The following diagram shows a custom training architecture in which Vertex AI APIs for training jobs and pipeline jobs are enabled and managed in a service project (serviceproject) as part of a Shared VPC deployment. These components are deployed as a Google-managed Infrastructure-as-a-Service (IaaS) in the service producer's VPC network. The service consumer's VPC network (hostproject) accesses these services through a private services access connection.

Private services access deployment options

You can create a new private connection or modify an existing one. Before you configure private services access, understand the considerations for choosing a VPC network and IP address range.

To create a new private connection, you must first create an allocated IP range and then create a private connection between your VPC network and Google-managed Vertex AI services.

Alternatively, you can modify an existing connection. For more information, see Modify a private connection.

Vertex AI subnet recommendations

The following table lists the recommended subnet ranges for Vertex AI services.

Vertex AI feature	Recommended subnet range
Managed notebook instances	/29
Vertex AI Pipelines	/21
Custom training jobs	/19
Vector Search online queries	/16
Online prediction private endpoints	/21

Deployment considerations

Following are some important considerations that affect how you establish communication between your on-premises, multicloud, and VPC workloads and Google-managed Vertex AI services.

IP advertisement

You must advertise the private services access subnet range from the Cloud Router as a custom advertised route. For more information, see Advertise custom IP ranges.

VPC Network Peering

The service producer's network might not have the correct routes to direct traffic to your on-premises network. By default, the service producer's network only learns the subnet routes from your VPC network. Therefore, any request that's not from a subnet IP range is dropped by the service producer.

For this reason, in your VPC network, you must update the peering connection to export custom routes to the service producer's network. Exporting routes sends all eligible static and dynamic routes that are in your VPC network, such as routes to your on-premises network, to the service producer's network. The service producer's network automatically imports them and then can send traffic back to your on-premises network through the VPC network.

Firewall rules

You must update the firewall rules for the VPC network that connects your on-premises and multicloud environments to Google Cloud to allow ingress traffic from and egress traffic to private services access subnets.