Quota is defined in terms of Cloud TPU cores. A single Cloud TPU device comprises 4 TPU chips and 8 cores: 2 cores per TPU chip. A Cloud TPU v2 Pod (beta) consists of 64 TPU devices containing 256 TPU chips (512 cores). A Cloud TPU v3 Pod (beta) consists of 256 TPU devices containing 1024 TPU chips (2048 cores).The number of cores also specifies the quota for a particular Cloud TPU. For example, a quota of 8 enables the use of 8 cores. A quota of 16 enables use of up to 16 cores, and so forth.
The notation: version-cores, for example v2-8, indicates the Cloud TPU version and the number of cores. Since the number of cores are also used to specify quota, this notation also describes a Cloud TPU quota allocation. For example, v2-32 indicates a TPU v2 type with 32 cores.
When you create a new Google Cloud Platform project, Cloud TPU allocates a default quota to the project.
Quota for single device TPU types
For single device TPU types, there are quota counts for on-demand core counts, and preemptible TPU core counts.
- On-demand TPUs: Default quota is 16 cores (2 TPU devices).
- Preemptible TPUs: Default quota is at least 48 cores (6 TPU devices).
Quota for TPU Pod types
The default quota for Cloud TPU Pods is 0. To use TPU Pod types, you must request quota using one of the following options:
- Request access to evaluation quota so that you can test the performance of TPU Pod types. TPU nodes that you create using evaluation quota are billed in one-second increments but do not guarantee the same level of service as on-demand TPU devices or devices that you create using commitment quota. Evaluation quota persists only for a limited amount of time on your project.
- Purchase a 1 year or 3 year commitment and create TPU nodes with up to 2048 cores. Commitments are not billed incrementally. Commitments bill you a monthly fee for the duration of your commitment term even if you do not use any TPU resources.
Requesting additional quota
The quota allocated for your GCP project is displayed on the Google Cloud Platform Console. If you need additional Cloud TPU quota, you can request it from the Quota page in the Google Cloud Platform Console.
Request evaluation quota for TPU Pods
The default quota for Cloud TPU Pods is 0. You must make a quota request to allocate Cloud TPU Pods for your application.
- Go to the Quotas page.
- From the Service menu, select Cloud TPU API.
- From the Metric menu, select TPU pod cores per project per region.
Alternatively, you can select TPU pod cores per project per zone.
- Select one or more regions or zones where you want to use Cloud TPU Pods.
For a complete list of TPU types available in each zone, see TPU types and zones.
- Click Edit Quotas.
- Fill out your name, email, and phone number and click Next.
- Enter your request to increase your quota and click Next.
- Submit your request.
You will receive a response from the Cloud TPU team within 1 to 2 business days of your request.