Mulai 29 April 2025, model Gemini 1.5 Pro dan Gemini 1.5 Flash tidak tersedia di project yang belum pernah menggunakan model ini, termasuk project baru. Untuk mengetahui detailnya, lihat Versi dan siklus proses model.
Tetap teratur dengan koleksi
Simpan dan kategorikan konten berdasarkan preferensi Anda.
Model OpenAI di Vertex AI menawarkan model sebagai API yang serverless dan terkelola sepenuhnya. Untuk menggunakan model OpenAI di Vertex AI, kirim
permintaan langsung ke endpoint Vertex AI API. Karena model OpenAI menggunakan API terkelola, tidak perlu menyediakan atau mengelola infrastruktur.
Anda dapat melakukan streaming respons untuk mengurangi persepsi latensi pengguna akhir. Respons
yang di-streaming menggunakan peristiwa yang dikirim server (SSE) untuk
mengalirkan respons secara bertahap.
Model OpenAI yang tersedia
Model berikut tersedia dari OpenAI untuk digunakan di
Vertex AI. Untuk mengakses model OpenAI, buka kartu modelnya di
Model Garden.
gpt-oss 120B
gpt-oss 120B OpenAI adalah model bahasa dengan bobot terbuka 120B yang dirilis berdasarkan lisensi Apache 2.0. Model ini sangat cocok untuk kasus penggunaan penalaran dan panggilan fungsi. Model
dioptimalkan untuk deployment di hardware konsumen.
Model 120B mencapai paritas yang hampir sama dengan o4-mini OpenAI pada tolok ukur penalaran inti, sambil berjalan di satu GPU 80 GB.
gpt-oss 20B OpenAI adalah model bahasa dengan bobot terbuka 20B yang dirilis berdasarkan lisensi Apache 2.0. Model ini sangat cocok untuk kasus penggunaan penalaran dan panggilan fungsi. Model
dioptimalkan untuk deployment di hardware konsumen.
Model 20B memberikan hasil yang serupa dengan o3-mini OpenAI pada tolok ukur umum dan dapat berjalan di perangkat edge dengan memori 16 GB, sehingga ideal untuk kasus penggunaan di perangkat, inferensi lokal, atau iterasi cepat tanpa infrastruktur yang mahal.
Untuk menggunakan model OpenAI dengan Vertex AI, Anda harus melakukan langkah-langkah berikut. Vertex AI API
(aiplatform.googleapis.com) harus diaktifkan untuk menggunakan
Vertex AI. Jika sudah memiliki project dengan
Vertex AI API yang diaktifkan, Anda dapat menggunakan project tersebut, bukan membuat
project baru.
Sign in to your Google Cloud account. If you're new to
Google Cloud,
create an account to evaluate how our products perform in
real-world scenarios. New customers also get $300 in free credits to
run, test, and deploy workloads.
In the Google Cloud console, on the project selector page,
select or create a Google Cloud project.
[[["Mudah dipahami","easyToUnderstand","thumb-up"],["Memecahkan masalah saya","solvedMyProblem","thumb-up"],["Lainnya","otherUp","thumb-up"]],[["Sulit dipahami","hardToUnderstand","thumb-down"],["Informasi atau kode contoh salah","incorrectInformationOrSampleCode","thumb-down"],["Informasi/contoh yang saya butuhkan tidak ada","missingTheInformationSamplesINeed","thumb-down"],["Masalah terjemahan","translationIssue","thumb-down"],["Lainnya","otherDown","thumb-down"]],["Terakhir diperbarui pada 2025-09-04 UTC."],[],[],null,["# OpenAI models\n\n| **Note:** OpenAI models are not a Google product, and its availability in Vertex AI is subject to the terms for \"Separate Offerings\" in the AI/ML Services section of the [Service Specific\n| Terms](/terms/service-terms), and separate terms found in the relevant model card.\n\nOpenAI models on Vertex AI offer fully managed and serverless\nmodels as APIs. To use an OpenAI model on Vertex AI, send\na request directly to the Vertex AI API endpoint. Because\nOpenAI models use a managed API, there's no need to provision or\nmanage infrastructure.\n\nYou can stream your responses to reduce the end-user latency perception. A\nstreamed response uses *server-sent events* (SSE) to incrementally stream the\nresponse.\n\nAvailable OpenAI models\n-----------------------\n\nThe following models are available from OpenAI to use in\nVertex AI. To access an OpenAI model, go to its\nModel Garden model card.\n\n### gpt-oss 120B\n\nOpenAI gpt-oss 120B is a 120B open-weight language model\nreleased under the Apache\n2.0 license. It is well-suited for reasoning and function calling use cases. The\nmodel is optimized for deployment on consumer hardware.\n\nThe 120B model achieves near-parity with OpenAI o4-mini on core reasoning\nbenchmarks, while running on a single 80GB GPU.\n\n[Go to the gpt-oss 120B model card](https://console.cloud.google.com/vertex-ai/publishers/openai/model-garden/gpt-oss-120b-maas)\n\n### gpt-oss 20B\n\nOpenAI gpt-oss 20B is a 20B open-weight language model\nreleased under the Apache\n2.0 license. It is well-suited for reasoning and function calling use cases. The\nmodel is optimized for deployment on consumer hardware.\n\nThe 20B model delivers similar results to OpenAI o3-mini on common benchmarks\nand can run on edge devices with 16GB of memory, making it ideal for on-device\nuse cases, local inference, or rapid iteration without costly infrastructure.\n\n[Go to the gpt-oss 20B model card](https://console.cloud.google.com/vertex-ai/publishers/openai/model-garden/gpt-oss-120b-maas)\n\n### Before you begin\n\nTo use OpenAI models with Vertex AI, you must perform the\nfollowing steps. The Vertex AI API\n(`aiplatform.googleapis.com`) must be enabled to use\nVertex AI. If you already have an existing project with the\nVertex AI API enabled, you can use that project instead of creating a\nnew project.\n\n- Sign in to your Google Cloud account. If you're new to Google Cloud, [create an account](https://console.cloud.google.com/freetrial) to evaluate how our products perform in real-world scenarios. New customers also get $300 in free credits to run, test, and deploy workloads.\n- In the Google Cloud console, on the project selector page,\n select or create a Google Cloud project.\n\n [Go to project selector](https://console.cloud.google.com/projectselector2/home/dashboard)\n-\n [Verify that billing is enabled for your Google Cloud project](/billing/docs/how-to/verify-billing-enabled#confirm_billing_is_enabled_on_a_project).\n\n-\n\n\n Enable the Vertex AI API.\n\n\n [Enable the API](https://console.cloud.google.com/flows/enableapi?apiid=aiplatform.googleapis.com)\n\n- In the Google Cloud console, on the project selector page,\n select or create a Google Cloud project.\n\n [Go to project selector](https://console.cloud.google.com/projectselector2/home/dashboard)\n-\n [Verify that billing is enabled for your Google Cloud project](/billing/docs/how-to/verify-billing-enabled#confirm_billing_is_enabled_on_a_project).\n\n-\n\n\n Enable the Vertex AI API.\n\n\n [Enable the API](https://console.cloud.google.com/flows/enableapi?apiid=aiplatform.googleapis.com)\n\n1. Go to one of the following Model Garden model cards, then click **Enable**."]]