각 Vertex AI 서비스에는 용량 계획 및 컴퓨팅 요구사항에 설명된 대로 그래픽 처리 장치 (GPU)를 비롯한 다양한 컴퓨팅 리소스가 필요합니다.
이 페이지에서는 GPU 환경에서 인공지능 (AI) 및 머신러닝 (ML) 워크로드를 실행하기 위해 GPU 리소스를 프로비저닝하는 방법을 설명합니다. 또한 이 페이지에서는 Vertex AI 기능을 구현할 수 있도록 Google Distributed Cloud (GDC) 에어 갭 어플라이언스에서 Vertex AI 사전 학습 API를 사용 설정하는 방법을 안내합니다.
GPU 리소스를 구성하고 Vertex AI 사전 학습 API를 사용 설정하거나 비활성화하는 대부분의 작업에는 관리자 액세스 권한이 필요합니다. 필요한 권한이 없는 경우 관리자에게 GPU와 Vertex AI 사전 학습된 API를 대신 사용 설정해 달라고 요청하세요.
GDC 에어갭 어플라이언스의 Vertex AI에는 선행 학습된 모델별로 하나씩 세 개의 API가 포함되어 있습니다. 이러한 사전 학습된 모델에 대해 자세히 알아보려면 다음 문서를 참고하세요.
[[["이해하기 쉬움","easyToUnderstand","thumb-up"],["문제가 해결됨","solvedMyProblem","thumb-up"],["기타","otherUp","thumb-up"]],[["이해하기 어려움","hardToUnderstand","thumb-down"],["잘못된 정보 또는 샘플 코드","incorrectInformationOrSampleCode","thumb-down"],["필요한 정보/샘플이 없음","missingTheInformationSamplesINeed","thumb-down"],["번역 문제","translationIssue","thumb-down"],["기타","otherDown","thumb-down"]],["최종 업데이트: 2025-09-04(UTC)"],[],[],null,["# Provision GPUs and enable Vertex AI pre-trained APIs\n\nEach Vertex AI service requires different computing resources,\nincluding graphics processing units (GPUs), as described in\n[Capacity planning and computing requirements](/distributed-cloud/hosted/docs/latest/appliance/application/ao-user/vertex-ai-overview#capacity).\nThis page describes how to provision GPU resources to run artificial\nintelligence (AI) and machine learning (ML) workloads in a GPU environment. This\npage also walks you through enabling the\nVertex AI pre-trained APIs on Google Distributed Cloud (GDC) air-gapped appliance so\nyou can start implementing Vertex AI capabilities.\n\nMost tasks to configure GPU resources and enable or deactivate\nVertex AI pre-trained APIs require administrator access. If you\nlack the necessary permissions, ask your administrator to enable GPUs and the\nVertex AI pre-trained APIs on your behalf.\n\nVertex AI on GDC air-gapped appliance includes three APIs, one for\neach of its pre-trained models. To learn more about these pre-trained models,\nsee the following documentation:\n\n- **Optical Character Recognition (OCR)** : [Learn about character recognition features](/distributed-cloud/hosted/docs/latest/appliance/application/ao-user/vertex-ai-ocr).\n- **Speech-to-Text** : [Learn about speech recognition features](/distributed-cloud/hosted/docs/latest/appliance/application/ao-user/vertex-ai-stt).\n- **Vertex AI Translation** : [Learn about translation features](/distributed-cloud/hosted/docs/latest/appliance/application/ao-user/vertex-ai-translation).\n\nUse the GDC console to enable, deactivate, and view the endpoints of\nthe Vertex AI pre-trained APIs.\n\nBefore you begin\n----------------\n\nTo get the permissions that you need to enable pre-trained APIs, ask your\nOrganization IAM Admin or Project IAM Admin to grant you the AI Platform Admin\n(`ai-platform-admin`) role in the project namespace.\n\nFor information about this role, see [Prepare IAM permissions](/distributed-cloud/hosted/docs/latest/appliance/application/ao-user/vertex-ai-ao-permissions).\nTo learn how to grant permissions to a subject, see [Grant and revoke access](/distributed-cloud/hosted/docs/latest/appliance/platform/pa-user/iam/set-up-role-bindings).\n\nFollow these steps to provision GPUs before enabling the pre-trained APIs:\n\n1. [Set up a project to use Vertex AI](/distributed-cloud/hosted/docs/latest/appliance/application/ao-user/vertex-ai-set-up-project).\n2. Ensure that your project has the adequate ingress communication configured.\n3. [Manage GPU workloads](/distributed-cloud/hosted/docs/latest/appliance/application/ao-user/containers/deploy-gpu-container-workloads).\n4. [Sign in to the GDC console](/distributed-cloud/hosted/docs/latest/appliance/platform/pa-user/iam/sign-in). If you can't sign in, see [Recover device if identity provider is unavailable](/distributed-cloud/hosted/docs/latest/appliance/admin/recover-idp).\n\nEnable pre-trained APIs\n-----------------------\n\nYou can enable the OCR, Speech-to-Text, and\nVertex AI Translation pre-trained APIs using the GDC console.\n| **Caution:** GDC air-gapped appliance includes a single NVIDIA A100 80GB GPU. This limits the device to running only *one* GPU-dependent Vertex AI pre-trained API at a time. If you attempt to enable both Vertex AI Translation and OCR, the second API will fail to enable with an error message indicating insufficient GPU resources. You can, however, run Speech-to-Text alongside either Vertex AI Translation or OCR, as Speech-to-Text only requires CPU resources.\n\nAfter [meeting the prerequisites](#before-you-enable-apis), follow these steps\nto enable the pre-trained APIs:\n\n1. [Sign in to the GDC console](/distributed-cloud/hosted/docs/latest/appliance/platform/pa-user/iam/sign-in).\n2. In the navigation menu, click **Vertex AI** \\\u003e **Pre-trained APIs**.\n3. On the **Pre-trained APIs** page, click **Enable** on a specific service to\n enable that API.\n\n | **Note:** The GDC console doesn't display the buttons to enable the pre-trained APIs if you don't have the correct permissions.\n4. In the confirmation dialog, click **Enable**. A progress message displays.\n\nThe enablement duration varies. It might take between 15 and 45 minutes to\nfinish.\n\nIf you want to view the status of the pre-trained APIs,\n[view the service status and endpoints](/distributed-cloud/hosted/docs/latest/appliance/application/ao-user/vertex-ai-api-status).\n| **Note:** You can't enable an API while you wait for another API to finish deactivating.\n\nDeactivate pre-trained APIs\n---------------------------\n\nYou can deactivate the OCR, Speech-to-Text, and\nVertex AI Translation pre-trained APIs using the GDC console.\n\nAfter [meeting the prerequisites](#before-you-enable-apis), follow these steps\nto deactivate the pre-trained APIs:\n\n1. [Sign in to the GDC console](/distributed-cloud/hosted/docs/latest/appliance/platform/pa-user/iam/sign-in).\n2. In the navigation menu, click **Vertex AI** \\\u003e **Pre-trained APIs**.\n3. On the **Pre-trained APIs** page, click **Disable** on a specific service to\n deactivate that API.\n\n | **Note:** The GDC console doesn't display the buttons to deactivate the pre-trained APIs if you don't have the correct permissions.\n4. In the confirmation dialog, enter `disable` in the text field to confirm\n that you want to take that action. Then, click **Disable**. A progress\n message displays.\n\nIf you want to view the status of the pre-trained APIs,\n[view the service status and endpoints](/distributed-cloud/hosted/docs/latest/appliance/application/ao-user/vertex-ai-api-status).\n| **Note:** You can't deactivate an API while you wait for another API to finish enabling."]]