[[["容易理解","easyToUnderstand","thumb-up"],["確實解決了我的問題","solvedMyProblem","thumb-up"],["其他","otherUp","thumb-up"]],[["難以理解","hardToUnderstand","thumb-down"],["資訊或程式碼範例有誤","incorrectInformationOrSampleCode","thumb-down"],["缺少我需要的資訊/範例","missingTheInformationSamplesINeed","thumb-down"],["翻譯問題","translationIssue","thumb-down"],["其他","otherDown","thumb-down"]],["上次更新時間:2025-08-25 (世界標準時間)。"],[],[],null,["This page explains what Provisioned Throughput is and when to use Provisioned Throughput.\n\nIntroduction to Provisioned Throughput\n\nProvisioned Throughput is a fixed-cost, fixed-term subscription\navailable in several term-lengths that reserves throughput for\n[supported generative AI models](/vertex-ai/generative-ai/docs/supported-models) on Vertex AI.\nTo reserve your throughput, you must specify the model and [available\nlocations](/vertex-ai/generative-ai/docs/learn/locations#available-regions) in which the model\nruns.\n\nWhen to use Provisioned Throughput\n\nIf any of the following considerations apply to your use case, consider using\nProvisioned Throughput:\n\n- You are building real-time generative AI production applications, such as chatbots and agents.\n- Your critical workloads consistently require high throughput. Throughput measurement depends on the model.\n- You want to provide a consistent and predictable experience for users of your applications.\n- You want deterministic generative AI costs by paying a fixed monthly or weekly price with control of overages.\n\nProvisioned Throughput is one of two ways to consume your\ngenerative AI models. The second way is pay-as-you-go, which is also referred to\nas [on-demand](/vertex-ai/generative-ai/docs/error-code-429#troubleshoot-dynamic-shared-quota).\n\nWhat's next\n\n- [Supported models](/vertex-ai/generative-ai/docs/supported-models) using Provisioned Throughput."]]