Network bandwidth

Google Cloud accounts for bandwidth per compute instance, not per virtual network interface (vNIC) or IP address. An instance's machine type defines its maximum possible egress rate; however, you can only achieve that maximum possible egress rate in specific situations.

This page outlines the network bandwidth limits, which are useful when planning your deployments. It categorizes bandwidth using two dimensions:

Egress or ingress: As used on this page, egress and ingress are always from the perspective of a Google Cloud instance:
- Packets sent from a Google Cloud instance compose its egress (outbound) traffic.
- Packets sent to a Google Cloud instance compose its ingress (inbound) traffic.
How the packet is routed: A packet can be routed from a sending instance or to a receiving instance using routes whose next hops are within a VPC network or routes outside of a VPC network.

All of the information on this page is applicable to Compute Engine compute instances, as well as products that depend on Compute Engine instances. For example, a Google Kubernetes Engine node is a Compute Engine instance.

Configurations that impact network bandwidth

Neither additional virtual network interfaces (vNICs) nor additional IP addresses per vNIC increase ingress or egress bandwidth for a compute instance. For example, a C3 VM with 22 vCPUs is limited to 23 Gbps total egress bandwidth. If you configure the C3 VM with two vNICs, the VM is still limited to 23 Gbps total egress bandwidth, not 23 Gbps bandwidth per vNIC.

The following sections describe how other compute instance configurations can impact network bandwidth.

Use per VM Tier_1 networking performance

To get the highest possible ingress and egress bandwidth, configure Tier_1 networking for your compute instance.

Dynamic Network Interfaces

Dynamic Network Interfaces (Preview) use the bandwidth of their parent vNIC. There is no traffic isolation within a parent vNIC. Network traffic from a Dynamic NIC can starve the other Dynamic NICs associated with the same parent vNIC. To avoid this conflict, you can use Linux traffic control (TC) to craft application-specific traffic shaping policies. These policies help to implement either fairness or certain types of priority. For prioritization, you map traffic (for example for Dynamic NICs) to a traffic class, and then map that traffic class to a quality of service. For an example of this approach, see Traffic shaping with Red Hat.

Dynamic NICs aren't supported for Compute Engine instances that run a Windows OS.

HPC computing with RDMA

Bandwidth is required for all three stages of a high performance computing (HPC) job: load, compute and store. During the load and store phases, applications commonly use TCP for communication, and remote direct memory address (RDMA) for the compute stages. H4D instances using GVNIC for the TCP traffic provide up to 200 Gbps network bandwidth. If you also configure an H4D instance to use Cloud RDMA, then the network bandwidth is shared among the configured RDMA network interfaces.

Network bandwidth allocation between Cloud RDMA traffic and TCP traffic is done dynamically. Instead of limiting both Cloud RDMA and TCP traffic to 100 Gbps bandwidth each, the GVNIC network interface can use all of the available bandwidth when the RDMA network interface is not being used. Similarly, the RDMA network interface can use all of the available bandwidth when the GVNIC network interface is not being used. When both network interface types are in use, RDMA traffic has priority over TCP traffic because it is more performance sensitive.

Bandwidth summary

The following table illustrates the maximum possible bandwidth based on whether a packet is sent from (egress) or received by (ingress) a compute instance and the packet routing method.

Egress bandwidth limits

Routing within a VPC network	Primarily defined by a per-instance maximum egress bandwidth based on the sending instance's machine type and whether Tier_1 networking is enabled. N2, N2D, C2, C2D, and C4A VMs with Tier_1 networking support egress bandwidth limits up to 100 Gbps. H3 VMs support VM-to-VM egress bandwidth limits up to 200 Gbps. H4D instances (Preview) support VM-to-VM egress bandwidth of up to 200 Gbps for RDMA and gVNIC combined. X4, M4, A2, and G2 instances support egress bandwidth limits up to 100 Gbps. G4 instances support egress bandwidth limits up to 400 Gbps. A4X instances support egress bandwidth limits up to 2,000 Gbps. A4 and A3 instances support egress bandwidth limits up to 3,600 Gbps. C4, C4D, C3, C3D, and Z3 instances support up to 200 Gbps egress bandwidth limits with Tier_1 networking. For other factors, definitions, and scenarios, see Egress to destinations routable within a VPC network.
Routing outside a VPC network	Primarily defined by a per-instance maximum egress bandwidth based on the sending instance's machine type and whether Tier_1 networking is enabled. Except for H3 VMs, a sending instance's maximum possible egress to a destination outside of its VPC network cannot exceed the following: 7 Gbps total when Tier_1 networking isn't enabled 25 Gbps total when Tier_1 networking is enabled 3 Gbps per flow For other factors, definitions, and caveats, see Egress to destinations outside of a VPC network.

Routing within
a VPC network

Primarily defined by a per-instance maximum egress bandwidth based on the sending instance's machine type and whether Tier_1 networking is enabled.
- N2, N2D, C2, C2D, and C4A VMs with Tier_1 networking support egress bandwidth limits up to 100 Gbps.
- H3 VMs support VM-to-VM egress bandwidth limits up to 200 Gbps.
- H4D instances (Preview) support VM-to-VM egress bandwidth of up to 200 Gbps for RDMA and gVNIC combined.
- X4, M4, A2, and G2 instances support egress bandwidth limits up to 100 Gbps.
- G4 instances support egress bandwidth limits up to 400 Gbps.
- A4X instances support egress bandwidth limits up to 2,000 Gbps.
- A4 and A3 instances support egress bandwidth limits up to 3,600 Gbps.
- C4, C4D, C3, C3D, and Z3 instances support up to 200 Gbps egress bandwidth limits with Tier_1 networking.
For other factors, definitions, and scenarios, see Egress to destinations routable within a VPC network.

Routing outside
a VPC network

Primarily defined by a per-instance maximum egress bandwidth based on the sending instance's machine type and whether Tier_1 networking is enabled. Except for H3 VMs, a sending instance's maximum possible egress to a destination outside of its VPC network cannot exceed the following:
- 7 Gbps total when Tier_1 networking isn't enabled
- 25 Gbps total when Tier_1 networking is enabled
- 3 Gbps per flow
For other factors, definitions, and caveats, see Egress to destinations outside of a VPC network.

Ingress bandwidth limits

Routing within a VPC network	Generally, ingress rates are similar to the egress rates for a machine type. To get the highest possible ingress bandwidth, enable Tier_1 networking. The size of your compute instance, the capacity of the server NIC, the traffic coming into other guest VMs running on the same host hardware, your guest OS network configuration, and the number of disk reads performed by your instance can all impact the ingress rate. Google Cloud doesn't impose any additional limitations on ingress rates within a VPC network. For other factors, definitions, and scenarios, see Ingress to destinations routable within a VPC network.
Routing outside a VPC network	Google Cloud protects each compute instance by limiting ingress traffic routed outside a VPC network. The limit is the first of the following rates encountered: 1,800,000 pps (packets per second) 30 Gbps For a machine series that supports multiple physical NICs, such as A3, the limit is the first of the following rates encountered: 1,800,000 pps (packets per second) per physical NIC 30 Gbps per physical NIC For other factors, definitions, and scenarios, see Ingress to destinations outside of a VPC network.

Routing within
a VPC network

Generally, ingress rates are similar to the egress rates for a machine type.
To get the highest possible ingress bandwidth, enable Tier_1 networking.
The size of your compute instance, the capacity of the server NIC, the traffic coming into other guest VMs running on the same host hardware, your guest OS network configuration, and the number of disk reads performed by your instance can all impact the ingress rate.
Google Cloud doesn't impose any additional limitations on ingress rates within a VPC network.
For other factors, definitions, and scenarios, see Ingress to destinations routable within a VPC network.

Routing outside
a VPC network

Google Cloud protects each compute instance by limiting ingress traffic routed outside a VPC network. The limit is the first of the following rates encountered:
- 1,800,000 pps (packets per second)
- 30 Gbps
For a machine series that supports multiple physical NICs, such as A3, the limit is the first of the following rates encountered:
- 1,800,000 pps (packets per second) per physical NIC
- 30 Gbps per physical NIC
For other factors, definitions, and scenarios, see Ingress to destinations outside of a VPC network.

Egress bandwidth

Google Cloud limits outbound (egress) bandwidth using per-instance maximum egress rates. These rates are based the machine type of the compute instance that is sending the packet and whether the packet's destination is accessible using routes within a VPC network or routes outside of a VPC network. Outbound bandwidth includes packets emitted by all of the instance's NICs and data transferred to all Hyperdisk and Persistent Disk volumes connected to the instance.

Per-instance maximum egress bandwidth

Per-instance maximum egress bandwidth is generally 2 Gbps per vCPU, but there are some differences and exceptions, depending on the machine series. The following table shows the range of maximum limits for egress bandwidth for traffic routed within a VPC network.

The following table summarizes the maximum egress bandwidth for each machine series. You can find the per-instance maximum egress bandwidth for every machine type listed on its specific machine family page (using the links for each machine series in the table).

	Maximum per-instance egress limit
Machine series	Standard	Tier_1 networking
C4, and C4D	100 Gbps	200 Gbps
C4A	50 Gbps	100 Gbps
C3 and C3D	100 Gbps	200 Gbps
C2 and C2D	32 Gbps	100 Gbps
E2	16 Gbps	N/A
E2 shared-core	2 Gbps	N/A
H4D and H3	200 Gbps	N/A
M4	100 Gbps	N/A
M3	32 Gbps	100 Gbps
M2	32 Gbps on Intel Cascade Lake or later CPU 16 Gbps on other CPU platforms	N/A
M1	32 Gbps	N/A
N4	50 Gbps	N/A
N2 and N2D	32 Gbps	100 Gbps
N1 (excluding VMs with 1 vCPU)	32 Gbps on Intel Skylake and later CPU 16 Gbps on earlier CPU platforms	N/A
N1 machine types with 1 vCPU	2 Gbps	N/A
N1 shared-core (f1-micro and g1-small)	1 Gbps	N/A
T2A and T2D	32 Gbps	N/A
X4	100 Gbps	N/A
Z3	100 Gbps	200 Gbps

For network bandwidth information for Accelerator-optimized machine series, see Networking and GPU machines.

Per-instance maximum egress bandwidth is not a guarantee. The actual egress bandwidth can be lowered according to factors such as the following non-exhaustive list:

Using VirtIO instead of gVNIC with compute instances that support both
Packet size
Protocol overhead
The number of flows
Ethernet driver settings of the compute instance's guest OS, such as checksum offload and TCP segmentation offload (TSO)
Network congestion
In a situation where Persistent Disk I/Os compete with other network egress traffic, 60% of the maximum network bandwidth is given to Persistent Disk writes, leaving 40% for other network egress traffic. See Factors that affect disk performance for more details.

To get the largest possible per-instance maximum egress bandwidth:

Enable per VM Tier_1 networking performance with larger machine types.
Use the largest VPC network maximum transmission unit (MTU) supported by your network topology. Larger MTUs can reduce packet-header overhead and increase payload data throughput.
Use the latest gVNIC driver version.
Use third generation or later machine series that use Titanium to offload network processing from the host CPU.

Egress to destinations routable within a VPC network

From the perspective of a sending instance and for destination IP addresses accessible by means of routes within a VPC network, Google Cloud limits outbound traffic using these rules:

Per-VM maximum egress bandwidth: The per-instance maximum egress bandwidth described in the Per-instance maximum egress bandwidth section.
Per-project inter-regional egress bandwidth: If a sending instance and an internal destination or its next hop are in different regions, Google Cloud enforces a quota based on the region and project of the sending instance and the region of the internal destination or next hop. For more information about this quota, see Inter-region network egress bandwidth (Mbps) from Compute instances in the VPC quotas and limits documentation.
Cloud VPN and Cloud Interconnect limits: When sending traffic from an instance to an internal IP address destination routable by a next hop Cloud VPN tunnel or Cloud Interconnect VLAN attachment, egress bandwidth is limited by:
- Maximum packet rate and bandwidth per Cloud VPN tunnel
- Maximum packet rate and bandwidth per VLAN attachment
- To fully use the bandwidth of multiple next hop Cloud VPN tunnels or Cloud Interconnect VLAN attachments using ECMP routing, you must use multiple TCP connections (unique 5-tuples).

Destinations routable within a VPC network include all of the following destinations, each of which is accessible from the perspective of the sending instance by a route whose next hop is not the default internet gateway:

Regional internal IPv4 addresses in subnet primary IPv4 and subnet secondary IPv4 address ranges, including private IPv4 address ranges and privately used public IPv4 address ranges, used by these destination resources:
- The primary internal IPv4 address of a receiving instance's network interface (vNIC). (When a sending instance connects to another instance's vNIC external IPv4 address, packets are routed using a next hop default internet gateway, so Egress to destinations outside of a VPC network applies instead.)
- An internal IPv4 address in an alias IP range of a receiving instance's vNIC.
- An internal IPv4 address of an internal forwarding rule for either protocol forwarding or for an internal passthrough Network Load Balancer.
Global internal IPv4 addresses for these destination resources:
- Private Service Connect endpoints for Google Cloud APIs
- Allocated ranges for private services access
Internal IPv6 subnet address ranges used by these destination resources:
- An IPv6 address from the /96 IPv6 address range assigned to a dual-stack or IPv6-only receiving instance's vNIC.
- An IPv6 address from the /96 IPv6 address range of an internal forwarding rule for either protocol forwarding or for an internal passthrough Network Load Balancer.
External IPv6 subnet address ranges used by these destination resources when packets are routed using subnet routes or peering subnet routes within the VPC network or by custom routes within the VPC network that do not use the default internet gateway next hop:
- An IPv6 address from the /96 IPv6 address range assigned to a dual-stack or IPv6-only receiving instance's vNIC.
- An IPv6 address from the /96 IPv6 address range of an external forwarding rule for either protocol forwarding or for an external passthrough Network Load Balancer.
Other destinations accessible using the following VPC network routes:
- Dynamic routes
- Static routes except those that use a default internet gateway next hop
- Peering custom routes

The following list ranks traffic from sending instances to internal destinations, from highest possible bandwidth to lowest:

Between compute instances in the same zone
Between compute instances in different zones of the same region
Between compute instances in different regions
From a compute instance to Google Cloud APIs and services using Private Google Access or accessing Google APIs from an instance's external IP address. This includes Private Service Connect endpoints for Google APIs.

Egress to destinations outside of a VPC network

From the perspective of a sending instance and for destination IP addresses outside of a VPC network, Google Cloud limits outbound traffic to whichever of the following rates is reached first:

Per-instance egress bandwidth: The maximum bandwidth for all connections from a compute instance to destinations outside of a VPC network is the smaller of the Per-instance maximum egress bandwidth and one of these rates:
- 25 Gbps, if Tier_1 networking is enabled
- 7 Gbps, if Tier_1 networking isn't enabled
- 1 Gbps for H4D and H3 instances
- 7 Gbps per physical NIC for machine series that support multiple physical NICs, such as A3.
Note: Maximum egress bandwidth can be achieved when the packet size matches the default MTU value of 1,460 bytes. As the packet size decreases, the maximum achievable bandwidth also declines due to limitations on the number of packets per second the VM can handle.

For example, even though an c3-standard-44 instance has a per-VM maximum egress bandwidth of 32 Gbps, the per-VM egress bandwidth from a c3-standard-44 VM to external destinations is either 25 Gbps or 7 Gbps, depending on whether Tier_1 networking is enabled.
Per-flow maximum egress rate: The maximum bandwidth for each unique 5-tuple connection, from a compute instance to a destination outside of a VPC network is 3 Gbps, except on H4D and H3, where it is 1 Gbps.
Per-project internet egress bandwidth: The maximum bandwidth for all connections from compute instances in each region of a project to destinations outside of a VPC network is defined by the project's Internet egress bandwidth quotas.

Destinations outside of a VPC network include all of the following destinations, each of which is accessible by a route in the sending instance's VPC network whose next hop is the default internet gateway:

Global external IPv4 and IPv6 addresses for external proxy Network Load Balancers and external Application Load Balancers
Regional external IPv4 addresses for Google Cloud resources, including VM vNIC external IPv4 addresses, external IPv4 addresses for external protocol forwarding, external passthrough Network Load Balancers, and response packets to Cloud NAT gateways.
Regional external IPv6 addresses in dual-stack or IPv6-only subnets with external IPv6 address ranges used by external IPv6 addresses of dual-stack or IPv6-only instances, external protocol forwarding, and external passthrough Network Load Balancers. The subnet must be located in a separate, non-peered VPC network. The destination IPv6 address range must be accessible using a route in the sending instance's VPC network whose next hop is the default internet gateway. If a dual-stack or IPv6-only subnet with an external IPv6 address range is located in the same VPC network or in a peered VPC network, see Egress to destinations routable within a VPC network instead.
Other external destinations accessible using a static route in the sending instance's VPC network provided that the next hop for the route is the default internet gateway.

For details about which Google Cloud resources use what types of external IP addresses, see External IP addresses.

Ingress bandwidth

Google Cloud handles inbound (ingress) bandwidth depending on how the incoming packet is routed to a receiving compute instance.

Ingress to destinations routable within a VPC network

A receiving instance can handle as many incoming packets as its machine type, operating system, and other network conditions permit. Google Cloud does not implement any purposeful bandwidth restriction on incoming packets delivered to an instance if the incoming packet is delivered using routes within a VPC network:

Subnet routes in the receiving instance's VPC network
Peering subnet routes in a peered VPC network
Routes in another network whose next hops are Cloud VPN tunnels, Cloud Interconnect (VLAN) attachments, or Router appliance instances located in the receiving instance's VPC network

Destinations for packets that are routed within a VPC network include:

The primary internal IPv4 address of the receiving instance's network interface (NIC). Primary internal IPv4 addresses are regional internal IPv4 addresses that come from a subnet's primary IPv4 address range.
An internal IPv4 address from an alias IP range of the receiving instance's NIC. Alias IP ranges can come from either a subnet's primary IPv4 address range or one of its secondary IPv4 address ranges.
An IPv6 address from the /96 IPv6 address range assigned to a dual-stack or IPv6-only receiving instance's NIC. Compute instance IPv6 ranges can come from these subnet IPv6 ranges:
- An internal IPv6 address range.
- An external IPv6 address range when the incoming packet is routed internally to the receiving instance using one of the VPC network routes listed previously in this section.
An internal IPv4 address of a forwarding rule used by internal protocol forwarding to the receiving instance or internal passthrough Network Load Balancer where the receiving instance is a backend of the load balancer. Internal forwarding rule IPv4 addresses come from a subnet's primary IPv4 address range.
An internal IPv6 address from the /96 IPv6 range of a forwarding rule used by internal protocol forwarding to the receiving instance or internal passthrough Network Load Balancer where the receiving instance is a backend of the load balancer. Internal forwarding rule IPv6 addresses come from a subnet's internal IPv6 address range.
An external IPv6 address from the /96 IPv6 range of a forwarding rule used by external protocol forwarding to the receiving instance or external passthrough Network Load Balancer. The receiving instance is a backend of the load balancer when the incoming packet is routed within the VPC network using one of the routes listed previously in this section. External forwarding rule IPv6 addresses come from a subnet's external IPv6 address range.
An IP address within the destination range of a custom static route that uses the receiving instance as a next hop instance (next-hop-instance or next-hop-address).
An IP address within the destination range of a custom static route using an internal passthrough Network Load Balancer (next-hop-ilb) next hop, if the receiving instance is a backend for that load balancer.

Ingress to destinations outside of a VPC network

Google Cloud implements the following bandwidth limits for incoming packets delivered to a receiving instance using routes outside a VPC network. When load balancing is involved, the bandwidth limits are applied individually to each receiving instance.

For machine series that don't support multiple physical NICs, the applicable inbound bandwidth restriction applies collectively to all virtual network interfaces (vNICs). The limit is the first of the following rates encountered:

1,800,000 packets per second
30 Gbps

For machine series that support multiple physical NICs, such as A3, the applicable inbound bandwidth restriction applies individually to each physical NIC. The limit is the first of the following rates encountered:

1,800,000 packets per second per physical NIC
30 Gbps per physical NIC

Destinations for packets that are routed using routes outside of a VPC network include:

An external IPv4 address assigned in a one-to-one NAT access configuration on one of the receiving instance's network interfaces (NICs).
An external IPv6 address from the /96 IPv6 address range assigned to a vNIC of a dual-stack or IPv6-only receiving instance when the incoming packet is routed using a route outside of the receiving instance's VPC network.
An external IPv4 address of a forwarding rule used by external protocol forwarding to the receiving instance or external passthrough Network Load Balancer where the receiving instance is a backend of the load balancer.
An external IPv6 address from the /96 IPv6 range of a forwarding rule used by external protocol forwarding to the receiving instance or external passthrough Network Load Balancer. The receiving instance must be a backend of the load balancer when the incoming packet is routed using a route outside of a VPC network.
Established inbound responses processed by Cloud NAT.

Use jumbo frames to maximize network bandwidth

To receive and send jumbo frames, configure the VPC network used by your compute instances; set the maximum transmission unit (MTU) to a larger value, up to 8896.

Higher MTU values increase the packet size and reduce the packet-header overhead, which increases payload data throughput.

You can use jumbo frames with the gVNIC driver version 1.3 or later on VM instances, or with the IDPF driver on bare metal instances. Not all Google Cloud public images include these drivers. For more information about operating system support for jumbo frames, see the Networking features tab on the Operating system details page.

If you are using an OS image that doesn't have full support for jumbo frames, you can manually install gVNIC driver version v1.3.0 or later. Google recommends installing the gVNIC driver version marked Latest to benefit from additional features and bug fixes. You can download the gVNIC drivers from GitHub.

To manually update the gVNIC driver version in your guest OS, see Use on non-supported operating systems.

Jumbo frames and GPU machines

For GPU machine types, use the recommended MTU settings for Jumbo frames. For more information, see Recommended MTU settings for Jumbo frames.

Receive and transmit queues

Each NIC or vNIC for a compute instance is assigned a number of receive and transmit queues for processing packets from the network.

Receive Queue (RX): Queue to receive packets. When the NIC receives a packet from the network, the NIC selects the descriptor for an incoming packet from the queue, processes it and hands the packet to the guest OS over a packet queue attached to a vCPU core using an interrupt. If the RX queue is full and there is no buffer available to place a packet, then the packet is dropped. This can typically happen if an application is over-utilizing a vCPU core that is also attached to the selected packet queue.
Transmit Queue (TX): Queue to transmit packets. When the guest OS sends a packet, a descriptor is allocated and placed in the TX queue. The NIC then processes the descriptor and transmits the packet.

Default queue allocation

Unless you explicitly assign queue counts for NICs, you can model the algorithm Google Cloud uses to assign a fixed number of RX and TX queues per NIC in this way:

Bare metal instances

For bare metal instances, there is only one NIC, so the maximum queue count is 16.

VM instances that use the gVNIC network interface

For C4 instances, to improve performance, the following configurations use a fixed number of queues:

For Linux instances with 2 vCPUs, the queue count is 1.
For Linux instances with 4 vCPUs, the queue count is 2.

For the other machine series, the queue count depends on whether the machine series uses Titanium or not.

For third generation (excluding M3) and later instances that use Titanium:

Divide the number of vCPUs by the number of vNICs (num_vcpus/num_vnics) and discard any remainder.
For first and second generation VMs that don't use Titanium:

Divide the number of vCPUs by the number of vNICs, and then divide the result by 2 (num_vcpus/num_vnics/2). Discard any remainder.

To finish the default queue count calculation:

If the calculated number is less than 1, assign each vNIC one queue instead.
Determine if the calculated number is greater than the maximum number of queues per vNIC, which is 16. If the calculated number is greater than 16, ignore the calculated number, and assign each vNIC 16 queues instead.

VM instances using the VirtIO network interface or a custom driver

Divide the number of vCPUs by the number of vNICs, and discard any remainder —[number of vCPUs/number of vNICs].

If the calculated number is less than 1, assign each vNIC one queue instead.
Determine if the calculated number is greater than the maximum number of queues per vNIC, which is 32. If the calculated number is greater than 32, ignore the calculated number, and assign each vNIC 32 queues instead.

H4D instances with RDMA

For H4D instance that uses Cloud RDMA, because of the limited machine types that are available for H4D, each host server has a single compute instance. Thus the instance gets all the available queue pairs.

Examples

The following examples show how to calculate the default number of queues for a VM instance:

If a VM instance uses VirtIO and has 16 vCPUs and 4 vNICs, the calculated number is [16/4] = 4. Google Cloud assigns each vNIC four queues.
If a VM instance uses gVNIC and has 128 vCPUs and two vNICs, the calculated number is [128/2/2] = 32. Google Cloud assigns each vNIC the maximum number of queues per vNIC possible. Google Cloud assigns 16 queues per vNIC.

On Linux systems, you can use ethtool to configure a vNIC with fewer queues than the number of queues Google Cloud assigns per vNIC.

Queue counts when using Dynamic Network Interface

If you use Dynamic Network Interfaces with your network interfaces, the queue counts don't change. A Dynamic NIC doesn't have its own receive and transmit queues; it uses the same queues as the parent vNIC.

Custom queue allocation for VM instances

Instead of the default queue allocation, you can assign a custom queue count (total of both RX and TX) to each vNIC when you create a new compute instance by using the Compute Engine API.

The number of custom queues you specify must adhere to the following rules:

The minimum queue count you can assign per vNIC is one.
The maximum queue count you can assign to each vNIC of a VM instance is the lower of the vCPU count or the per vNIC maximum queue count, based on the driver type:
- Using virtIO or a custom driver, the maximum queue count is 32.
- Using gVNIC, the maximum queue count is 16, except for the following, where the maximum queue count is 32:
  - A2 or G2 instances
  - TPU instances
  - C2, C2D, N2, or N2D instances with Tier_1 networking enabled
- For the following Confidential VM configurations, the maximum queue count is 8:
  - AMD SEV on C2D and N2D machine types
  - AMD SEV-SNP on N2D machine types
If you assign custom queue counts to all NICs of the compute instance, the sum of your queue count assignments must be less than or equal to the number of vCPUs assigned to the instance.

You can oversubscribe the custom queue count for your vNICs. In other words, you can have a sum of the queue counts assigned to all NICs for your VM instance that is greater than the number of vCPUs for your instance. To oversubscribe the custom queue count, the VM instance must satisfy the following conditions:

Use gVNIC as the vNIC type for all NICs configured for the instance.
Uses a machine type that supports Tier_1 networking.
Has Tier_1 networking enabled.
Specified a custom queue count for all NICs configured for the instance.

With queue oversubscription, the maximum queue count for the VM instance is 16 times the number of NICs. So, if you have 6 NICs configured for an instance with 30 vCPUs, you can configure a maximum of (16 * 6), or 96 custom queues for your instance.

Examples

If a VM instance has 8 vCPUs and 3 vNICs, the maximum queue count for the instance is the number of vCPUS, or 8. You can assign 1 queue to nic0, 4 queues to nic1, and 3 queues to nic2. In this example, you can't subsequently assign 4 queues to nic2 while keeping the other two vNIC queue assignments because the sum of assigned queues cannot exceed the number of vCPUs.
If you have a N2 VM with 96 vCPUs and 2 vNICs, you can assign both vNICs up to 32 queues each when using the virtIO driver, or up to 16 queues each when using the gVNIC driver. If you enable Tier_1 networking for the N2 VM, then you can assign up to 32 queues to each vNIC. In this example, the sum of assigned queues is always less than or equal to the number of vCPUs.

It's also possible to assign a custom queue count for only some NICs, letting Google Cloud assign queues to the remaining NICs. The number of queues you can assign per vNIC is still subject to rules mentioned previously. You can model the feasibility of your configuration, and, if your configuration is possible, the number of queues that Google Cloud assigns to the remaining vNICs with this process:

Calculate the sum of queues for the vNICs using custom queue assignment. For an example VM with 20 vCPUs and 6 vNICs, suppose you assign nic0 5 queues, nic1 6 queues, nic2 4 queues, and let Google Cloud assign queues for nic3, nic4, and nic5. In this example, the sum of custom-assigned queues is 5+6+4 = 15.
Subtract the sum of custom-assigned queues from the number of vCPUs. If the difference is less than the number of remaining vNICs for which Google Cloud must assign queues, Google Cloud returns an error because each vNIC must have at least one queue.

Continuing the example with a VM that has 20 vCPUs and a sum of 15 custom-assigned queues, Google Cloud has 20-15 = 5 queues left to assign to the remaining vNICs (nic3, nic4, nic5).
Divide the difference from the previous step by the number of remaining vNICs and discard any remainder —⌊(number of vCPUs - sum of assigned queues)/(number of remaining vNICs)⌋. This calculation always results in a whole number (not a fraction) that is at least equal to one because of the constraint explained in the previous step. Google Cloud assigns each remaining vNIC a queue count matching the calculated number as long as the calculated number is not greater than the maximum number of queues per vNIC. The maximum number of queues per vNIC depends on the driver type:

Using virtIO or a custom driver, if the calculated number of queues for each remaining vNIC is greater than 32, Google Cloud assigns each remaining vNIC 32 queues.
Using gVNIC, if the calculated number of queues for each remaining vNIC is greater than the limit of 16 or 32 (depending on the VM configuration), Google Cloud assigns each remaining vNIC 16 queues.

Configure custom queue counts

To create a compute instance that uses a custom queue count for one or more NICs or vNICs, complete the following steps.

In the following code examples, the VM is created with the network interface type set to GVNIC and per VM Tier_1 networking performance enabled. You can use these code examples to specify the maximum queue counts and queue oversubscription that is available for the supported machine types.

gcloud

If you don't already have a VPC network with a subnet for each vNIC interface you plan to configure, create them.
Use the gcloud compute instances create command to create the compute instance. Repeat the --network-interface flag for each vNIC that you want to configure for the instance, and include the queue-count option.

    gcloud compute instances create INSTANCE_NAME \
        --zone=ZONE \
        --machine-type=MACHINE_TYPE \
        --network-performance-configs=total-egress-bandwidth-tier=TIER_1  \
        --network-interface=network=NETWORK_NAME_1,subnet=SUBNET_1,nic-type=GVNIC,queue-count=QUEUE_SIZE_1 \
        --network-interface=network=NETWORK_NAME_2,subnet=SUBNET_2,nic-type=GVNIC,queue-count=QUEUE_SIZE_2

Replace the following:

INSTANCE_NAME: a name for the new compute instance
ZONE: the zone to create the instance in
MACHINE_TYPE: the machine type of the instance. To oversubscribe the queue count, the machine type you specify must support gVNIC and Tier_1 networking.
NETWORK_NAME: the name of the network created previously
SUBNET_*: the name of one of the subnets created previously
QUEUE_SIZE: the number of queues for the vNIC, subject to the rules discussed in Custom queue allocation.

Terraform

If you don't already have a VPC network with a subnet for each vNIC interface you plan to configure, create them.

Create a compute instance with specific queue counts for vNICs using the google_compute_instance resource. Repeat the --network-interface parameter for each vNIC you want to configure for the compute instance, and include the queue-count parameter.

# Queue oversubscription instance
resource "google_compute_instance" "VM_NAME" {
project      = "PROJECT_ID"
boot_disk {
  auto_delete = true
  device_name = "DEVICE_NAME"
  initialize_params {
     image="IMAGE_NAME"
     size = DISK_SIZE
     type = "DISK_TYPE"
  }
}
machine_type = "MACHINE_TYPE"
name         = "VM_NAME"
zone = "ZONE"

network_performance_config {
    total_egress_bandwidth_tier = "TIER_1"
}

network_interface {
    nic_type = "GVNIC"
    queue_count = QUEUE_COUNT_1
    subnetwork_project = "PROJECT_ID"
    subnetwork = "SUBNET_1"
 }

network_interface {
    nic_type = "GVNIC"
    queue_count = QUEUE_COUNT_2
    subnetwork_project = "PROJECT_ID"
    subnetwork = "SUBNET_2"
}

network_interface {
    nic_type = "GVNIC"
    queue_count = QUEUE_COUNT_3
    subnetwork_project = "PROJECT_ID"
    subnetwork = "SUBNET_3""
}

network_interface {
    nic_type = "GVNIC"
    queue_count = QUEUE_COUNT_4
    subnetwork_project = "PROJECT_ID"
    subnetwork = "SUBNET_4""
}

}

Replace the following:

VM_NAME: a name for the new compute instance
PROJECT_ID: ID of the project to create the instance in. Unless you are using a Shared VPC network, the project you specify must be the same one in which all the subnets and networks were created in.
DEVICE_NAME: The name to associate with the boot disk in the guest OS
IMAGE_NAME: the name of an image,for example, "projects/debian-cloud/global/images/debian-11-bullseye-v20231010".
DISK_SIZE: the size of the boot disk, in GiB
DISK_TYPE: the type of disk to use for the boot disk, for example, pd-standard
MACHINE_TYPE: the machine type of the instance. To oversubscribe the queue count, the machine type you specify must support gVNIC and Tier_1 networking.
ZONE: the zone to create the instance in
QUEUE_COUNT: the number of queues for the vNIC, subject to the rules discussed in Custom queue allocation.
SUBNET_*: the name of the subnet that the network interface connects to

REST

If you don't already have a VPC network with a subnet for each vNIC interface you plan to configure, create them.
Create a compute instance with specific queue counts for NICs using the instances.insert method. Repeat the networkInterfaces property to configure multiple network interfaces.
```
POST https://compute.googleapis.com/compute/v1/projects/PROJECT_ID/zones/ZONE/instances
{
"name": "VM_NAME",
"machineType": "machineTypes/MACHINE_TYPE",
"networkPerformanceConfig": {
    "totalEgressBandwidthTier": TIER_1
},
"networkInterfaces": [
    {
      "nicType": gVNIC,
      "subnetwork":"regions/region/subnetworks/SUBNET_1",
      "queueCount": "QUEUE_COUNT_1"
    } ],
"networkInterfaces": [
    {
      "nicType": gVNIC,
      "subnetwork":"regions/region/subnetworks/SUBNET_2",
      "queueCount": "QUEUE_COUNT_2"
    } ],
}
```
Replace the following:
- PROJECT_ID: ID of the project to create the compute instance in
- ZONE: zone to create the compute instance in
- VM_NAME: name of the new compute instance
- MACHINE_TYPE: machine type, predefined or custom, for the new compute instance. To oversubscribe the queue count, the machine type must support gVNIC and Tier_1 networking.
- SUBNET_*: the name of the subnet that the network interface connects to
- QUEUE_COUNT: Number of queues for the vNIC, subject to the rules discussed in Custom queue allocation.

Queue allocations and changing the machine type

Compute instances are created with a default queue allocation, or you can assign a custom queue count to each virtual network interface card (vNIC) when you create a new compute instance by using the Compute Engine API. The default or custom vNIC queue assignments are only set when creating a compute instance. If your instance has vNICs that use default queue counts, you can change its machine type. If the machine type that you are changing to has a different number of vCPUs, the default queue counts for your instance are recalculated based on the new machine type.

If your VM has vNICs which use custom, non-default queue counts, then you can change the machine type by using the Google Cloud CLI or Compute Engine API to update the instance properties. The conversion succeeds if the resulting VM supports the same queue count per vNIC as the original instance. For VMs that use the VirtIO-Net interface and have a custom queue count that is higher than 16 per vNIC, you can't change the machine type to a third generation or later machine type, because they use only gVNIC. Instead, you can migrate your VM to a third generation or later machine type by following the instructions in Move your workload to a new compute instance.

What's next

Learn more about machine types.
Learn more about Virtual machine instances.
Create and start a VM instance.
Configure per VM Tier_1 networking performance for a compute instance.
Complete the quickstart tutorial Create a Linux VM instance in Compute Engine.
Complete the quickstart tutorial Create a Windows Server VM instance in Compute Engine.