Run workloads in Autopilot mode in Standard clusters

Autopilot Standard

Cluster administrators and application operators can get the benefits of Google Kubernetes Engine (GKE) Autopilot, like pricing and pre-configured settings, in Standard mode clusters. This document shows you how to use ComputeClasses to deploy an Autopilot workload in a Standard cluster. You should already be familiar with the following concepts:

About Autopilot ComputeClasses

GKE provides Kubernetes custom resources named ComputeClasses that can be deployed in your cluster like any other Kubernetes resources. A ComputeClass defines a list of node configurations, like machine types or Spot VMs. You can select ComputeClasses in your workloads, which indicates to GKE that any new nodes should use one of the configurations in that list.

If a workload selects a ComputeClass that has the autopilot field enabled, GKE runs the Pods in Autopilot mode. The nodes that GKE creates are managed by Google and include many of the default Autopilot feature and security settings. For more information about the implications of running an Autopilot workload in your Standard clusters, including differences that you might notice when you deploy those workloads, see About Autopilot mode workloads in GKE Standard.

Types of Autopilot ComputeClasses

GKE provides built-in Autopilot ComputeClasses that you can use for most general-purpose workloads. You can also configure a new or existing custom ComputeClass to use Autopilot mode. The type of Autopilot ComputeClass that you use depends on whether your workloads need specific hardware, as follows:

General-purpose workloads: use one of the built-in Autopilot ComputeClasses, which place Pods on the container-optimized compute platform.
Workloads that require specific hardware: enable Autopilot mode for any custom ComputeClass, deploy that ComputeClass to the cluster, and select that ComputeClass in your workloads.

For more information about these options, when to use them, and the pricing for each option, see Hardware selection in Autopilot ComputeClasses.

Pricing

GKE Autopilot pricing applies to the workloads and nodes that use an Autopilot ComputeClass. The pricing model that applies depends on whether you use a built-in Autopilot ComputeClass or a custom Autopilot ComputeClass. For more information, see Pricing in "About Autopilot mode workloads in GKE Standard".

Before you begin

Before you start, make sure that you have performed the following tasks:

Enable the Google Kubernetes Engine API.

Enable Google Kubernetes Engine API

If you want to use the Google Cloud CLI for this task, install and then initialize the gcloud CLI. If you previously installed the gcloud CLI, get the latest version by running the gcloud components update command. Earlier gcloud CLI versions might not support running the commands in this document.
Note: For existing gcloud CLI installations, make sure to set the compute/region property. If you use primarily zonal clusters, set the compute/zone instead. By setting a default location, you can avoid errors in the gcloud CLI like the following: One of [--zone, --region] must be supplied: Please specify location. You might need to specify the location in certain commands if the location of your cluster differs from the default that you set.

Use a GKE Standard cluster that runs version 1.33.1-gke.1107000 or later and is enrolled in the Rapid release channel. To create a new cluster, see Creating a regional cluster.
To avoid workload rejections, learn about the requirements and security constraints of Autopilot. For more information, see predefined settings for Autopilot nodes.

Requirements

At least one node pool in the cluster must have no node taints.

This node pool is required to run GKE Standard system Pods that can't run on Autopilot nodes in Standard clusters because of the taints that GKE adds to those nodes.
Shielded GKE Nodes is required, and is enabled by default.
You must use a VPC-native cluster.
If you use Kubernetes NetworkPolicies, your cluster must use GKE Dataplane V2. By default, all new clusters use GKE Dataplane V2.

If your cluster doesn't use GKE Dataplane V2, you must disable network policy enforcement.

Limitations

Only the Rapid release channel is supported.
To update existing ComputeClasses in the cluster to use Autopilot mode, you must recreate those ComputeClasses with an updated specification. For more information, see Enable Autopilot for an existing custom ComputeClass.
You can't use the podFamily priority rule in your own ComputeClasses. This rule is available only in built-in Autopilot ComputeClasses.
The built-in Autopilot ComputeClasses don't support enabling Confidential GKE Nodes for your entire cluster. If you enable Confidential GKE Nodes for the cluster, any new Pods that select the built-in Autopilot ComputeClasses remain in the Pending state indefinitely.
Calico network policy enforcement isn't supported. You must use GKE Dataplane V2 or disable network policy enforcement.
The name of your ComputeClass can't begin with gke or autopilot, which are reserved prefixes.

Required roles and permissions

To get the permissions that you need to deploy ComputeClasses, ask your administrator to grant you the Kubernetes Engine Developer (roles/container.developer) IAM role on your cluster or project . For more information about granting roles, see Manage access to projects, folders, and organizations.

You might also be able to get the required permissions through custom roles or other predefined roles.

Modify clusters to meet Autopilot requirements

You can use the Google Cloud console to check whether your Standard cluster meets all of the requirements to run workloads in Autopilot mode. You can also use the Google Cloud console to modify the cluster to meet these requirements.

Modify an existing cluster

In the Google Cloud console, go to the Kubernetes clusters page.

Go to Kubernetes clusters
In the row for the cluster that you want to modify, click More actions > Edit. The Cluster details page opens.
In the Cluster basics section, find the Autopilot compute class compatibility section.

If this section displays Enabled, the cluster is already compatible with Autopilot. Skip to the Select an Autopilot ComputeClass in a workload section.
If the Autopilot compute class compatibility section displays Disabled, click Edit Autopilot compute class compatibility.

If this section is unavailable to edit, your cluster uses a permanent setting that's incompatible with Autopilot mode. For example, you can't modify clusters to be VPC-native after cluster creation. If you can't interact with the Autopilot compute class compatibility section, you must create a new cluster.
In the Autopilot compute class compatibility pane that opens, review the cluster settings that need to change to meet the requirements of Autopilot mode.
Click Enable Autopilot compute class. GKE modifies the cluster as needed.

Modify a new cluster

In the Google Cloud console, go to the Create a Kubernetes cluster page.

Go to Create a Kubernetes cluster
On the Cluster basics page, find the Maximize deployment options with Autopilot compute class section. This section shows you the cluster settings that need to change to meet the requirements of Autopilot mode.
Click Enable Autopilot compute class. GKE modifies the cluster as needed.
Configure other cluster settings based on your requirements. If you modify a setting that makes the cluster incompatible with Autopilot, a caution message appears.

Note: Certain cluster settings, like VPC-native traffic routing, are permanent. If you modify a permanent setting when you create your cluster, you can't update that cluster for Autopilot compatibility later.

Select an Autopilot ComputeClass in a workload

You can run a workload in Autopilot mode in your Standard cluster by selecting a ComputeClass that uses Autopilot mode. To run a workload in Autopilot mode, select one of the following options:

Console

In the Google Cloud console, go to the GKE Workloads page.

Go to Workloads
Click Deploy or Create Job. The workload creation page for a Deployment or a Job appears.
In the Nodes section, select Autopilot compute class.
In the Select compute class section, in the Compute class drop-down list, select a ComputeClass that uses Autopilot mode. This ComputeClass can be any of the following:
- One of the following built-in Autopilot ComputeClasses, which place general-purpose workloads on the Autopilot container-optimized compute platform:
  - autopilot
  - autopilot-spot
- A ComputeClass that you create, such as the n4-class ComputeClass that's described in the Configure a custom Autopilot ComputeClass section.
Configure and create the workload.

kubectl CLI

To select an Autopilot ComputeClass in a workload, use a node selector for the cloud.google.com/compute-class label. This is the same label that you use to select any other ComputeClass in GKE. The following steps show you how to create an example Deployment that selects a ComputeClass and verify that the Pods run in Autopilot mode:

Save the following example Deployment as autopilot-cc-deployment.yaml:

apiVersion: apps/v1
kind: Deployment
metadata:
  name: helloweb
  labels:
    app: hello
spec:
  selector:
    matchLabels:
      app: hello
  template:
    metadata:
      labels:
        app: hello
    spec:
      nodeSelector:
        # Replace with the name of a compute class
        cloud.google.com/compute-class: COMPUTE_CLASS 
      containers:
      - name: hello-app
        image: us-docker.pkg.dev/google-samples/containers/gke/hello-app:1.0
        ports:
        - containerPort: 8080
        resources:
          requests:
            cpu: "250m"
            memory: "1Gi"

Replace COMPUTE_CLASS with the name of the compute class to use. This value can be any of the following:

One of the following built-in Autopilot ComputeClasses, which place general-purpose workloads on the Autopilot container-optimized compute platform:
- autopilot
- autopilot-spot
A ComputeClass that you create, such as the n4-class ComputeClass that's described in the Configure a custom Autopilot ComputeClass section.

Deploy the workload:

kubectl apply -f autopilot-cc-deployment.yaml

Configure a custom Autopilot ComputeClass

You can configure custom ComputeClasses to use Autopilot. Use a custom Autopilot ComputeClass if your workloads require specific hardware to run optimally, like GPUs or a certain Compute Engine machine series.

If your workloads don't require specific hardware, we recommend that you use one of the built-in Autopilot ComputeClasses instead. To select a built-in Autopilot ComputeClass, see the preceding Select an Autopilot ComputeClass in a workload section.

Create a new custom Autopilot ComputeClass

Save the following example ComputeClass manifest as n4-class.yaml:
```
apiVersion: cloud.google.com/v1
kind: ComputeClass
metadata:
  name: n4-class
spec:
  autopilot:
    enabled: true
  priorities:
  - machineFamily: n4
    spot: true
    minCores: 16
  - machineFamily: n4
    spot: true
  - machineFamily: n4
    spot: false
  activeMigration:
    optimizeRulePriority: true
```
This manifest includes the following fields:
- autopilot: enables Autopilot mode for the ComputeClass. If you specify this field in a ComputeClass that you deploy to an Autopilot cluster, GKE ignores the field.
- priorities: defines an array of three different N4 machine family configurations.
- activeMigration: lets GKE migrate Pods to configurations that are higher in the list of priorities when resources become available.
Deploy the ComputeClass:
```
kubectl apply -f n4-class.yaml
```

Verify that the ComputeClass exists:

kubectl get computeclasses

The output is similar to the following:

NAME                  AGE
n4-class              3s

Enable Autopilot for an existing custom ComputeClass

You can enable Autopilot in existing custom ComputeClasses that are in a Standard cluster. Enabling Autopilot in a ComputeClass that's in an Autopilot cluster has no effect, because the entire cluster uses Autopilot mode.

After you enable Autopilot for an existing ComputeClass, GKE uses Autopilot to run new Pods that select the ComputeClass. If you have existing Pods on Standard nodes that select the Autopilot ComputeClass, those Pods use Autopilot only when they're recreated.

To update an existing custom ComputeClass to use Autopilot mode, follow these steps:

In a text editor, update the manifest file for your existing ComputeClass to add the spec.autopilot field:
```
spec:
  autopilot:
    enabled: true
```
Replace the existing ComputeClass resource in the Kubernetes API with the updated specification:
```
kubectl replace --force -f PATH_TO_UPDATED_MANIFEST
```
Replace PATH_TO_UPDATED_MANIFEST with the path to your updated manifest file.
To trigger new node creation, recreate any workloads that use the compute class.

After you apply the updated manifest, any new nodes that GKE creates for this ComputeClass use Autopilot. GKE doesn't modify any existing nodes that were created prior to the update.

Verify that your workload uses Autopilot

Select one of the following options:

Console

In the Google Cloud console, go to the GKE Workloads page.

Go to Workloads
For your workload, check the value in the Node type column. If the workload uses Autopilot mode, this value is Autopilot-managed.

kubectl CLI

Check the names of the nodes that run your Pods:

kubectl get pods -l=app=hello -o wide

The output is similar to the following:

NAME                       READY   STATUS    RESTARTS   AGE     IP             NODE                                         NOMINATED NODE   READINESS GATES
helloweb-79b9f6f75-5wwc9   1/1     Running   0          152m    10.102.1.135   gk3-cluster-1-nap-10abc8ya1-f66c6cef-wg5g   <none>           <none>
helloweb-79b9f6f75-9skb9   1/1     Running   0          4d3h    10.102.0.140   gk3-cluster-1-nap-10abc8ya1-632bac02-hjl6   <none>           <none>
helloweb-79b9f6f75-h7bdv   1/1     Running   0          152m    10.102.1.137   gk3-cluster-1-nap-10abc8ya1-f66c6cef-wg5g   <none>           <none>

In this output, the gk3- prefix in the Node column indicates that the node is managed by Autopilot.

Apply an Autopilot ComputeClass by default

GKE lets you set a ComputeClass as the default for a namespace. The namespace default class applies to all Pods in that namespace that don't explicitly select a different ComputeClass. Setting an Autopilot ComputeClass as the default means that you can run all Pods in a namespace in Autopilot mode by default unless the workload selects a different option.

For more information, see Configure a default ComputeClass for a namespace.

What's next

For the parameters that you can specify in ComputeClasses, see the ComputeClass CustomResourceDefinition.