Quotas and limits

This document lists the quotas and limits that apply to fleet management.

Google Cloud uses quotas to help ensure fairness and reduce spikes in resource use and availability. A quota restricts how much of a Google Cloud resource your Google Cloud project can use. Quotas apply to a range of resource types, including hardware, software, and network components. For example, quotas can restrict the number of API calls to a service, the number of load balancers used concurrently by your project, or the number of projects that you can create. Quotas protect the community of Google Cloud users by preventing the overloading of services. Quotas also help you to manage your own Google Cloud resources.

The Cloud Quotas system does the following:

  • Monitors your consumption of Google Cloud products and services
  • Restricts your consumption of those resources
  • Provides a means to request changes to the quota value

In most cases, when you attempt to consume more of a resource than its quota allows, the system blocks access to the resource, and the task that you're trying to perform fails.

Quotas generally apply at the Google Cloud project level. Your use of a resource in one project doesn't affect your available quota in another project. Within a Google Cloud project, quotas are shared across all applications and IP addresses.

Allocation quotas

The following quotas don't reset over time and instead are released when you release the resource:

Quotas Default value
Maximum number of clusters (memberships) per fleet 50

To view your current limit and usage:

  1. Go to the IAM and Admin Quotas page for your fleet host project in the Google Cloud console. This displays all your project's quotas.
  2. Browse or filter for Service: GKE Hub.
  3. Select the Memberships quota to view more details.

Request a quota increase

To increase or decrease most quotas, use the Google Cloud console. For more information, see Request a higher quota.