Quota Policy

Quota allocation

Quota is defined in terms of Cloud TPU cores. A single Cloud TPU device comprises 4 TPU chips and 8 cores: 2 cores per TPU chip. A Cloud TPU v2 Pod consists of 64 TPU devices containing 256 TPU chips (512 cores). A Cloud TPU v3 Pod consists of 256 TPU devices containing 1024 TPU chips (2048 cores).The number of cores also specifies the quota for a particular Cloud TPU. For example, a quota of 8 enables the use of 8 cores. A quota of 16 enables use of up to 16 cores, and so forth.

The notation: version-cores, for example v2-8, indicates the Cloud TPU version and the number of cores. Since the number of cores are also used to specify quota, this notation also describes a Cloud TPU quota allocation. For example, v2-32 indicates a TPU v2 type with 32 cores.

When you create a new Google Cloud project, Cloud TPU allocates a default quota to the project.

Quota for single device TPU types

For single device TPU types, there are quota counts for on-demand core counts, and preemptible TPU core counts.

  • On-demand TPUs: Default quota is 16 cores (2 TPU devices).
  • Preemptible TPUs: Default quota is at least 48 cores (6 TPU devices).

Quota for TPU Pod types

The default quota for Cloud TPU Pods is 0. To use TPU Pod types, you must request evaluation quota or request additional quota.

Evaluation quota

Request access to evaluation quota so that you can test the performance of TPU Pod types. TPU nodes that you create using evaluation quota are billed in one-second increments but do not guarantee the same level of service as on-demand TPU devices or devices that you create using commitment quota. Evaluation quota persists only for a limited amount of time on your project.

Requesting additional quota

The quota allocated for your Google Cloud project is displayed on the Google Cloud Console. If you need additional Cloud TPU quota, you can request it from the Quota page in the Google Cloud Console using the following procedure:

  1. Go to the Quotas page.

    Go to the Quotas page

  2. From the Service menu, select Cloud TPU API.
  3. From the Metric menu, select TPU pod cores per project per region.

    Alternatively, you can select TPU pod cores per project per zone.

  4. Select one or more regions or zones where you want to use Cloud TPU Pods.

    For a complete list of TPU types available in each zone, see TPU types and zones.

  5. Click Edit Quotas.
  6. Fill out your name, email, and phone number and click Next.
  7. Enter your request to increase your quota and click Next.
  8. Submit your request.

You will receive a response from the Cloud TPU team within 1 to 2 business days of your request.