[[["容易理解","easyToUnderstand","thumb-up"],["確實解決了我的問題","solvedMyProblem","thumb-up"],["其他","otherUp","thumb-up"]],[["難以理解","hardToUnderstand","thumb-down"],["資訊或程式碼範例有誤","incorrectInformationOrSampleCode","thumb-down"],["缺少我需要的資訊/範例","missingTheInformationSamplesINeed","thumb-down"],["翻譯問題","translationIssue","thumb-down"],["其他","otherDown","thumb-down"]],["上次更新時間:2025-09-04 (世界標準時間)。"],[],[],null,["# Generative AI on Vertex AI inference API errors\n\nThis guide provides a list of errors that you might encounter from using the\n[Model API reference for Generative\nAI](/vertex-ai/generative-ai/docs/model-reference/overview). The errors follow\nthe [error model](/apis/design/errors) of the Google Cloud API, which recommends\nthat we provide guidance on the causes and the solutions specific to the\ngenerative AI models.\n\nAPI errors\n----------\n\nThis table provides API error codes and descriptions.\n\nHandle errors\n-------------\n\nAvoid spikes in traffic. Spikes are sudden and significant increases in the\nnumber of requests within a very short period of time. Sometimes, spikes in\ntraffic might cause issues for quota enforcement and might increase the chance\nof server overloading.\n\nBe careful about retrying an event. We recommend retrying no more than two\ntimes. The minimum delay is one second with subsequent requests backing up\nexponentially.\n\nWhat's next\n-----------\n\n- Generative AI on Vertex AI has some limitations. To learn more, see [PaLM API limitations](/vertex-ai/generative-ai/docs/learn/responsible-ai#limitations).\n- Try a quickstart tutorial using [Vertex AI Studio](/vertex-ai/generative-ai/docs/start/quickstarts/quickstart) or the [Vertex AI API](/vertex-ai/generative-ai/docs/start/quickstarts/quickstart-multimodal).\n- Explore pretrained models in [Model Garden](/vertex-ai/generative-ai/docs/model-garden/explore-models).\n- Learn about [quotas and limits](/vertex-ai/docs/quotas).\n- Learn about [pricing](/vertex-ai/pricing#generative_ai_models)."]]