Design workload separation

This document provides an overview for workload management in Google Distributed Cloud (GDC) air-gapped. The following topics are covered:

Where to deploy workloads
Kubernetes cluster best practices

Although some of the workload deployment designs are recommended, it's not required to follow them exactly as prescribed. Each GDC universe has unique requirements and considerations that must be satisfied on a case-by-case basis.

This document is for IT administrators within the platform administrator group who are responsible for managing resources within their organization, and application developers within the application operator group who are responsible for developing and maintaining applications in a GDC universe.

For more information, see Audiences for GDC air-gapped documentation.

Where to deploy workloads

On the GDC platform, operations to deploy virtual machine (VM) workloads and container workloads are different. The following diagram illustrates workload separation within the data plane layer of your organization.

Workload separation in organization data plane.

VM-based workloads operate within a VM. Conversely, container workloads operate within a Kubernetes cluster. The fundamental separation between VMs and Kubernetes clusters provide isolation boundaries between your VM workloads and container workloads. For more information, see Resource hierarchy.

The following sections introduce the differences between each workload type and their deployment lifecycle.

VM-based workloads

You can create VMs to host your VM-based workloads. You have many configuration options for your VM's shape and size to help best meet your VM-based workload requirements. You must create a VM in a project, which can have many VM workloads. VMs are a child resource of a project. For more information, see the VMs overview.

Projects containing only VM-based workloads don't require a Kubernetes cluster. Therefore, you don't need to provision Kubernetes clusters for VM-based workloads.

Container-based workloads

You can deploy container-based workloads to a pod on a Kubernetes cluster. A Kubernetes cluster consists of the following node types:

Control plane node: runs the management services, such as scheduling, etcd, and an API server.
Worker node: runs your pods and container applications.

Kubernetes cluster architecture

Kubernetes clusters can be attached to one or many projects, but they are not a child resource of a project. This is a fundamental difference that Kubernetes clusters have compared to VMs. A VM is a child resource of a project, whereas Kubernetes clusters operate as a child resource of an organization, allowing them to attach to multiple projects.

For pod scheduling within a Kubernetes cluster, GDC adopts the general Kubernetes concepts of scheduling, preemption, and eviction. Best practices on scheduling pods within a cluster vary based on the requirements of your workload.

For more information on Kubernetes clusters, see the Kubernetes cluster overview. For more information on managing your containers in a Kubernetes cluster, see Container workloads in GDC.

Best practices for designing Kubernetes clusters

This section introduces best practices for designing Kubernetes clusters:

Create separate clusters per software development environment
Create fewer, larger clusters
Create fewer, larger node pools within a cluster

Consider each best practice to design a resilient cluster design for your container workload lifecycle.

Create separate clusters per software development environment

In addition to separate projects per software development environment, we recommend that you design separate Kubernetes clusters per software development environment. A software development environment is an area within your GDC universe intended for all operations that correspond to a designated lifecycle phase. For example, if you have two software development environments named development and production in your organization, you could create a separate set of Kubernetes clusters for each environment and attach projects to each cluster based on your needs. We recommend Kubernetes clusters in pre-production and production lifecycles to have multiple projects attached to them.

Defined clusters for each software development environment assumes that workloads within a software development environment can share clusters. You then assign projects to the Kubernetes cluster of the appropriate environment. A Kubernetes cluster might be further subdivided into multiple node pools or use taints for workload isolation.

By separating Kubernetes clusters by software development environment, you isolate resource consumption, access policies, maintenance events, and cluster-level configuration changes between your production and non-production workloads.

The following diagram shows a sample Kubernetes cluster design for multiple workloads that span projects, clusters, software development environments, and machine classes.

GDC configuration

This sample architecture assumes that workloads within a production and development software development environment can share clusters. Each environment has a separate set of Kubernetes clusters, which are further subdivided into multiple node pools for different machine class requirements.

Alternatively, designing multiple Kubernetes clusters is useful for container operations like the following scenarios:

You have some workloads pinned to a specific Kubernetes version, so you maintain different clusters at different versions.
You have some workloads that require different cluster configuration needs, such as the backup policy, so you create multiple clusters with different configurations.
You run copies of a cluster in parallel to facilitate disruptive version upgrades or a blue-green deployment strategy.
You build an experimental workload that risks throttling the API server or other single point of failures within a cluster, so you isolate it from existing workloads.

The following diagram shows an example where multiple clusters are configured per software development environment due to requirements such as the container operations described in the previous section.

GDC configuration

Create fewer clusters

For efficient resource utilization, we recommend designing the fewest number of Kubernetes clusters that meet your requirements for separating software development environments and container operations. Each additional cluster incurs additional overhead resource consumption, such as additional control plane nodes required. Therefore, a larger cluster with many workloads utilizes underlying compute resources more efficiently than many small clusters.

When there are multiple clusters with similar configurations, it creates additional maintenance overhead to monitor cluster capacity and plan for cross-cluster dependencies.

If a cluster is approaching capacity, we recommend that you add additional nodes to a cluster instead of creating a new cluster.

Create fewer node pools within a cluster

For efficient resource utilization, we recommend designing fewer, larger node pools within a Kubernetes cluster.

Configuring multiple node pools is useful when you need to schedule pods that require a different machine class than others. Create a node pool for each machine class your workloads require, and set the node capacity to autoscaling to allow for efficient usage of compute resources.