This document covers the request quotas for the maximum number of requests that you can make to the OS Config API for the VM Manager suite of tools in a project. It also covers the allocation quotas for OS Config resources.
A quota limit restricts how much of a shared Google Cloud resource your Google Cloud project can use, including hardware, software, and network components. Therefore, quota limits are a part of a system that does the following:
- Monitors your use or consumption of Google Cloud products and services.
- Restricts your consumption of those resources for reasons, which include ensuring fairness and reducing spikes in usage.
- Maintains configurations that automatically enforce prescribed restrictions.
- Provides a means to request or make changes to the quota.
In most cases, when a quota limit is exceeded, the system immediately blocks access to the relevant Google resource, and the task that you're trying to perform fails. In most cases, quota limits apply to each Google Cloud project and are shared across all applications and IP addresses that use that Google Cloud project.
To increase or decrease most quota limits, use the Google Cloud console. For more information, see Request a higher quota limit.
Request quotas
When you use the Google Cloud CLI or the Google Cloud console, you are also making requests to the API and these requests count towards your request quota. If you use service accounts to access the API, that also counts towards your quota.
Quotas are enforced at intervals of every 60 seconds. That means that if you reach a specific limit anytime within 60 seconds, you need to wait for your quota bucket to refresh to make more requests.
Per user quotas
Quota group | Detail | Default quota |
---|---|---|
Read requests |
|
60 requests per user per minute |
Write requests |
|
60 requests per user per minute |
Lookup requests (Legacy beta) |
|
600 requests per user per minute |
Per region quotas
Quota group | Detail | Default quota |
---|---|---|
Read requests |
|
100 requests per region per minute |
Write requests |
|
60 requests per region per minute |
Allocation quotas
Allocation quotas are the maximum number of resources you can create of that resource type. OS Config resources include the following: patch deployments, OS policy assignments, and guest policies.
The following quotas don't reset over time and instead are released when you release the resource.
Resource | Detail | Default quota |
---|---|---|
Patch jobs |
|
100 patch jobs per project |
OS policy assignments |
|
20 OS policy assignments per zone per project |
Guest policies (Legacy beta) |
|
100 guest policies per project |
Manage request and allocation quotas
To manage the request and allocation quotas for your project, do the following:
- Follow the best practices for preserving API rate limits.
Use the Google Cloud console to view and edit request quotas:
- If you want to lower request quotas, see Capping usage.
- If you need higher request quotas than the default maximum, request a higher quota limit. In your request, add information showing the consumption rate in your environment. These include VM Manager audit logs or other error messages stating that the rate limit is exceeded. It's also useful to include information such as future growth plans, region or zone spread, and any additional requirements or dependencies.
What's next?
- Learn more about VM Manager
- Learn more about Service Quota Model.