[[["容易理解","easyToUnderstand","thumb-up"],["確實解決了我的問題","solvedMyProblem","thumb-up"],["其他","otherUp","thumb-up"]],[["難以理解","hardToUnderstand","thumb-down"],["資訊或程式碼範例有誤","incorrectInformationOrSampleCode","thumb-down"],["缺少我需要的資訊/範例","missingTheInformationSamplesINeed","thumb-down"],["翻譯問題","translationIssue","thumb-down"],["其他","otherDown","thumb-down"]],["上次更新時間:2025-09-04 (世界標準時間)。"],[],[],null,["# Error code 429\n\nIf the number of your requests exceeds the capacity allocated to process\nrequests, then error code `429` is returned. The following table displays the\nerror message generated by each type of quota framework:\n\nWith a Provisioned Throughput subscription, you can reserve an\namount of throughput for specific generative AI models. If you don't have a\nProvisioned Throughput subscription and resources aren't available\nto your application, then an error code `429` is returned. Although you don't\nhave reserved capacity, you can try your request again. However, the request\nisn't counted against your error rate as described in your [service level\nagreement (SLA)](/vertex-ai/generative-ai/sla).\n\nFor projects that have purchased Provisioned Throughput,\nVertex AI measures a project's throughput and reserves the purchased\namount of throughput for the project's actual usage.\n\nFor standard Provisioned Throughput, when you use less than your\npurchased amount, errors that might otherwise be `429` are returned as `5XX` and\ncount toward the SLA error rate. For Single Zone Provisioned Throughput,\nwhen you use less than your purchased amount, capacity-related `429` errors are\ntreated as `5XX` but don't count toward the SLA error rate. When you exceed your\npurchased amount, the additional requests are processed on-demand as pay-as-you-go.\n\nPay-as-you-go\n-------------\n\nOn the pay-as-you-go quota framework, you have the following options to\nresolving `429` errors:\n\n- Use the [global endpoint](/vertex-ai/generative-ai/docs/learn/locations#global-endpoint) instead of a regional endpoint whenever possible.\n- Implement a retry strategy by using [truncated exponential backoff](/storage/docs/retry-strategy#exponential-backoff).\n- If your model uses quotas, you can submit a Quota Increase Request (QIR). If your model uses [Dynamic shared\n quota](/vertex-ai/generative-ai/docs/dynamic-shared-quota#supported_models), smoothing traffic and reducing large spikes can help. For more information, see [Dynamic shared\n quota (DSQ)](/vertex-ai/generative-ai/docs/dynamic-shared-quota).\n- Subscribe to Provisioned Throughput for a more consistent level of service. For more information, see [Provisioned Throughput](/vertex-ai/generative-ai/docs/provisioned-throughput).\n\nProvisioned Throughput\n----------------------\n\nTo correct the 429 error generated by Provisioned Throughput, do the\nfollowing:\n\n- Use the [Default behavior\n example](/vertex-ai/generative-ai/docs/use-provisioned-throughput#default), which doesn't set a header in prediction requests. Any overages are processed on-demand and billed as pay-as-you-go.\n- Increase the number of GSUs in your Provisioned Throughput subscription.\n\nWhat's next\n-----------\n\n- To learn more about dynamic shared quota, see [Dynamic shared\n quota](/vertex-ai/generative-ai/docs/dsq).\n- To learn more about Provisioned Throughput, see [Provisioned Throughput](/vertex-ai/generative-ai/docs/provisioned-throughput).\n- To learn about quotas and limits for Vertex AI, see [Vertex AI quotas and limits](/vertex-ai/docs/quotas).\n- To learn more about Google Cloud quotas and system limits, see the [Cloud Quotas documentation](/docs/quotas/overview).\n- To learn more about API errors, see [API errors](/vertex-ai/generative-ai/docs/model-reference/api-errors)."]]