This guide shows you how to troubleshoot The following table shows the error message for each quota framework: In the pay-as-you-go model, you use a shared pool of resources. If resources aren't available when you make a request, Vertex AI returns a To resolve If you have a Provisioned Throughput subscription, you receive a To resolve When you subscribe to Provisioned Throughput, Vertex AI reserves the purchased amount of throughput for your project. How Vertex AI handles requests varies depending on whether you use more or less than your purchased throughput:429
resource exhausted errors for different quota frameworks in Vertex AI. A 429
error indicates that the number of your requests exceeds the capacity allocated to process them.
Quota framework
Message
Pay-as-you-go
Resource exhausted, please try again later.
Provisioned Throughput
Too many requests. Exceeded the Provisioned Throughput.
Troubleshoot pay-as-you-go errors
429
error. This error doesn't count against your error rate as described in your service level agreement (SLA).429
errors, consider the following options:
Troubleshoot Provisioned Throughput errors
429
error when your requests exceed your reserved throughput and you have configured your endpoint to reject overages.429
errors, you can do one of the following:
Provisioned Throughput behavior
429
are returned as 5XX
and count toward the SLA error rate.429
errors are treated as 5XX
but don't count toward the SLA error rate.What's next
Error code 429
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2025-08-18 UTC.