Stay organized with collections
Save and categorize content based on your preferences.
This document lists the quotas and system limits that apply to
AI Hypercomputer.
Quotas specify the amount of a countable, shared resource that you
can use. Quotas are defined by Google Cloud services such as
AI Hypercomputer.
System limits are fixed values that cannot be changed.
Google Cloud uses quotas to help ensure fairness and reduce
spikes in resource use and availability. A quota restricts how much of a
Google Cloud resource your Google Cloud project can use. Quotas
apply to a range of resource types, including hardware, software, and network
components. For example, quotas can restrict the number of API calls to a
service, the number of load balancers used concurrently by your project, or the
number of projects that you can create. Quotas protect the community of
Google Cloud users by preventing the overloading of services. Quotas also
help you to manage your own Google Cloud resources.
The Cloud Quotas system does the following:
Monitors your consumption of Google Cloud products and services
In most cases, when you attempt to consume more of a resource than its quota
allows, the system blocks access to the resource, and the task that
you're trying to perform fails.
Quotas generally apply at the Google Cloud project
level. Your use of a resource in one project doesn't affect
your available quota in another project. Within a Google Cloud project, quotas
are shared across all applications and IP addresses.
Allocation quotas
The following quotas don't reset over time and instead are released when you
release the resource:
Quota
Value
Internal Load Balancer forwarding rules per VPC network
30
Rate quotas
The following quotas apply to AI Hypercomputer requests:
Quota
Value
Requests per second
10
Images per feature per day
700,000
Images per feature per month
20,000,000
System limits
AI Hypercomputer enforces the following system limits:
System limit
Value
MB per image
4 MB
MB per request
8 MB
Images per request
16
Request a quota adjustment
To adjust most quotas, use the Google Cloud console.
For more information, see
Request a quota adjustment.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-09-04 UTC."],[[["\u003cp\u003eQuotas are limits on the amount of shared resources a Google Cloud project can use, helping to ensure fairness and prevent service overloading.\u003c/p\u003e\n"],["\u003cp\u003eSystem limits are fixed constraints on resource usage within AI Hypercomputer that cannot be adjusted.\u003c/p\u003e\n"],["\u003cp\u003eAI Hypercomputer has specific request quotas, including 10 requests per second, 700,000 images per feature per day, and 20,000,000 images per feature per month.\u003c/p\u003e\n"],["\u003cp\u003eAllocation quotas, like the 30 internal load balancer forwarding rules per VPC network, are released when the resource is released.\u003c/p\u003e\n"],["\u003cp\u003eThe system will block access when a resource quota is exceeded, and most quota adjustments can be requested via the Google Cloud console.\u003c/p\u003e\n"]]],[],null,["This document lists the quotas and system limits that apply to\nAI Hypercomputer.\n\n- *Quotas* specify the amount of a countable, shared resource that you can use. Quotas are defined by Google Cloud services such as AI Hypercomputer.\n- *System limits* are fixed values that cannot be changed.\n\nGoogle Cloud uses quotas to help ensure fairness and reduce\nspikes in resource use and availability. A quota restricts how much of a\nGoogle Cloud resource your Google Cloud project can use. Quotas\napply to a range of resource types, including hardware, software, and network\ncomponents. For example, quotas can restrict the number of API calls to a\nservice, the number of load balancers used concurrently by your project, or the\nnumber of projects that you can create. Quotas protect the community of\nGoogle Cloud users by preventing the overloading of services. Quotas also\nhelp you to manage your own Google Cloud resources.\n\nThe Cloud Quotas system does the following:\n\n- Monitors your consumption of Google Cloud products and services\n- Restricts your consumption of those resources\n- Provides a way to [request changes to the quota value](/docs/quotas/help/request_increase) and [automate quota adjustments](/docs/quotas/quota-adjuster)\n\nIn most cases, when you attempt to consume more of a resource than its quota\nallows, the system blocks access to the resource, and the task that\nyou're trying to perform fails.\n\nQuotas generally apply at the Google Cloud project\nlevel. Your use of a resource in one project doesn't affect\nyour available quota in another project. Within a Google Cloud project, quotas\nare shared across all applications and IP addresses.\n\nAllocation quotas\n\nThe following quotas don't reset over time and instead are released when you\nrelease the resource:\n\n| Quota | Value |\n|---------------------------------------------------------|-------|\n| Internal Load Balancer forwarding rules per VPC network | 30 |\n\nRate quotas\n\nThe following quotas apply to AI Hypercomputer requests:\n\n| Quota | Value |\n|------------------------------|------------|\n| Requests per second | 10 |\n| Images per feature per day | 700,000 |\n| Images per feature per month | 20,000,000 |\n\nSystem limits\n\nAI Hypercomputer enforces the following system limits:\n\n| System limit | Value |\n|--------------------|-------|\n| MB per image | 4 MB |\n| MB per request | 8 MB |\n| Images per request | 16 |\n\nRequest a quota adjustment\n\nTo adjust most quotas, use the Google Cloud console.\nFor more information, see\n[Request a quota adjustment](/docs/quotas/help/request_increase)."]]