Google models

Featured Gemini models

2.5 Pro

Our most advanced reasoning Gemini model, made to solve complex problems

Best for multimodal understanding
Capable of processing complex prompts and providing well-rounded responses
Best for coding, particularly for web development

2.5 Flash

Our best model in terms of price-performance, offering well-rounded capabilities

Support for Live API included for some endpoints
See the model's thinking process as part of the response
Balances price and performance

2.5 Flash-Lite

Our most cost effective model that supports high throughput tasks

The fastest model in the 2.5 line of models
Features a 1 million token context window and multimodal input, like 2.5 Flash
Outperforms 2.0 Flash on most evaluation benchmarks

Generally available Gemini models

diamond Gemini 2.5 Pro Our most advanced reasoning model to date

spark Gemini 2.5 Flash Our best model in terms of price-performance, offering well-rounded capabilities

photo_spark Gemini 2.5 Flash Image Our standard model upgraded for rapid creative workflows with image generation and conversational, multi-turn editing capabilities

performance_auto Gemini 2.5 Flash-Lite Our most cost effective model that supports high throughput tasks

spark Gemini 2.0 Flash Our newest multimodal model, with next generation features and improved capabilities

performance_auto Gemini 2.0 Flash-Lite A Gemini 2.0 Flash model optimized for cost efficiency and low latency

Preview Gemini models

mic_detect_auto Gemini 2.5 Flash Live API Our standard model upgraded for real-time, conversational experiences with streaming capabilities

Gemma models

Gemma 3n The latest open models, designed for efficient execution on low-resource devices, capable of multimodal input, handling text, image, video, and audio input, and generating text outputs, and trained with data in over 140 spoken languages

Gemma 3 The third of generation of our open models, featuring the ability to solve a wide variety of tasks with text and image input, support for over 140 languages, and long 128K context window

Gemma 2 The second of generation of our open models featuring text generation, summarization, and extraction

Gemma A small-sized, lightweight open model supporting text generation, summarization, and extraction

ShieldGemma 2 Instruction tuned models for evaluating the safety of text and images against a set of defined safety policies

PaliGemma Our open vision-language model that combines SigLIP and Gemma

CodeGemma Powerful, lightweight open model that can perform a variety of coding tasks like fill-in-the-middle code completion, code generation, natural language understanding, mathematical reasoning, and instruction following

TxGemma Generates predictions, classifications or text based on therapeutic related data and can be used to efficiently build AI models for therapeutic-related tasks with less data and less compute

MedGemma Collection of Gemma 3 variants that are trained for performance on medical text and image comprehension

MedSigLIP SigLIP variant that is trained to encode medical images and text into a common embedding space

T5Gemma A family of lightweight yet powerful encoder-decoder research models from Google

Embeddings models

width_normal Embeddings for Text Converts text data into vector representations for semantic search, classification, clustering, and similar tasks

width_normal Multimodal Embeddings Generates vectors based on images, which can be used for downstream tasks like image classification, image search, and more

Generally available Imagen models

photo_spark Imagen 4 for Generation Use text prompts to generate novel images with higher quality than our previous image generation models

photo_spark Imagen 4 for Fast Generation Use text prompts to generate novel images with higher quality and lower latency than our previous image generation models

photo_spark Imagen 4 for Ultra Generation Use text prompts to generate novel images with higher quality and better prompt adherence than our previous image generation models

photo_spark Imagen 3 for Generation Use text prompts to generate novel images

image_edit_auto Imagen 3 for Editing and Customization Use text prompts to edit existing input images, or parts of an image with a mask or generate new images based upon the context provided by input reference images

photo_spark Imagen 3 for Fast Generation Use text prompts to generate novel images with lower latency than our other image generation models

subtitles Imagen for Captioning & VQA Use text prompts to generative novel images, edit existing ones, edit parts of an image with a mask and more

Preview Imagen models

photo_spark Virtual Try-On Generate images of people wearing clothing products.

image_edit_auto Imagen product recontext on Vertex AI Use text prompts to edit product images into different scenes or backgrounds.

Veo models

movie Veo 2 Generate Use text prompts and images to generate novel videos

movie Veo 3 Generate Use text prompts and images to generate novel videos with higher quality than our previous video generation model

movie Veo 3 Fast Use text prompts and images to generate novel videos with higher quality and lower latency than our previous video generation model

Preview Veo models

movie Veo 3 Generate preview Use text prompts and images to generate novel videos with higher quality than our previous video generation model

movie Veo 3 Fast preview Use text prompts and images to generate novel videos with higher quality and lower latency than our previous video generation model

movie Veo 3.1 Generate preview Use text prompts and images to generate novel videos with higher quality than our previous video generation model

movie Veo 3.1 Fast preview Use text prompts and images to generate novel videos with higher quality and lower latency than our previous video generation model

movie Veo 2 Preview Use text prompts and images to generate novel videos. This model version supports inpaint and outpaint.

Experimental Veo models

movie Veo 2 Experimental An experimental model, with features under test.

MedLM models

medical_information MedLM-medium HIPAA-compliant suite of medically tuned models designed to help healthcare practitioners with medical question and answer tasks, and summarization tasks for healthcare and medical documents

clinical_notes MedLM-large-large HIPAA-compliant suite of medically tuned models designed to help healthcare practitioners with medical question and answer tasks, and summarization tasks for healthcare and medical documents

Language support

Gemini

All the Gemini models can understand and respond in the following languages:

Afrikaans (af), Albanian (sq), Amharic (am), Arabic (ar), Armenian (hy), Assamese (as), Azerbaijani (az), Basque (eu), Belarusian (be), Bengali (bn), Bosnian (bs), Bulgarian (bg), Catalan (ca), Cebuano (ceb), Chinese (Simplified and Traditional) (zh), Corsican (co), Croatian (hr), Czech (cs), Danish (da), Dhivehi (dv), Dutch (nl), English (en), Esperanto (eo), Estonian (et), Filipino (Tagalog) (fil), Finnish (fi), French (fr), Frisian (fy), Galician (gl), Georgian (ka), German (de), Greek (el), Gujarati (gu), Haitian Creole (ht), Hausa (ha), Hawaiian (haw), Hebrew (iw), Hindi (hi), Hmong (hmn), Hungarian (hu), Icelandic (is), Igbo (ig), Indonesian (id), Irish (ga), Italian (it), Japanese (ja), Javanese (jv), Kannada (kn), Kazakh (kk), Khmer (km), Korean (ko), Krio (kri), Kurdish (ku), Kyrgyz (ky), Lao (lo), Latin (la), Latvian (lv), Lithuanian (lt), Luxembourgish (lb), Macedonian (mk), Malagasy (mg), Malay (ms), Malayalam (ml), Maltese (mt), Maori (mi), Marathi (mr), Meiteilon (Manipuri) (mni-Mtei), Mongolian (mn), Myanmar (Burmese) (my), Nepali (ne), Norwegian (no), Nyanja (Chichewa) (ny), Odia (Oriya) (or), Pashto (ps), Persian (fa), Polish (pl), Portuguese (pt), Punjabi (pa), Romanian (ro), Russian (ru), Samoan (sm), Scots Gaelic (gd), Serbian (sr), Sesotho (st), Shona (sn), Sindhi (sd), Sinhala (Sinhalese) (si), Slovak (sk), Slovenian (sl), Somali (so), Spanish (es), Sundanese (su), Swahili (sw), Swedish (sv), Tajik (tg), Tamil (ta), Telugu (te), Thai (th), Turkish (tr), Ukrainian (uk), Urdu (ur), Uyghur (ug), Uzbek (uz), Vietnamese (vi), Welsh (cy), Xhosa (xh), Yiddish (yi), Yoruba (yo), and Zulu (zu).

Gemma

Gemma and Gemma 2 support only the English (en) language. Gemma 3 and Gemma 3n provide multilingual support in over 140 languages.

Embeddings

Multilingual text embedding models support the following languages:

Afrikaans (af), Albanian (sq), Amharic (am), Arabic (ar), Armenian (hy), Azerbaijani (az), Basque (eu), Belarusian (be), Bengali (bn), Bulgarian (bg), Catalan (ca), Cebuano (ceb), Chinese (Simplified and Traditional) (zh), Corsican (co), Czech (cs), Danish (da), Dutch (nl), English (en), Esperanto (eo), Estonian (et), Filipino (Tagalog) (fil), Finnish (fi), French (fr), Frisian (fy), Galician (gl), Georgian (ka), German (de), Greek (el), Gujarati (gu), Haitian Creole (ht), Hausa (ha), Hawaiian (haw), Hebrew (iw), Hindi (hi), Hmong (hmn), Hungarian (hu), Icelandic (is), Igbo (ig), Indonesian (id), Irish (ga), Italian (it), Japanese (ja), Javanese (jv), Kannada (kn), Kazakh (kk), Khmer (km), Korean (ko), Kurdish (ku), Kyrgyz (ky), Lao (lo), Latin (la), Latvian (lv), Lithuanian (lt), Luxembourgish (lb), Macedonian (mk), Malagasy (mg), Malay (ms), Malayalam (ml), Maltese (mt), Maori (mi), Marathi (mr), Mongolian (mn), Myanmar (Burmese) (my), Nepali (ne), Nyanja (Chichewa) (ny), Norwegian (no), Pashto (ps), Persian (fa), Polish (pl), Portuguese (pt), Punjabi (pa), Romanian (ro), Russian (ru), Samoan (sm), Scots Gaelic (gd), Serbian (sr), Sesotho (st), Shona (sn), Sindhi (sd), Sinhala (Sinhalese) (si), Slovak (sk), Slovenian (sl), Somali (so), Spanish (es), Sundanese (su), Swahili (sw), Swedish (sv), Tajik (tg), Tamil (ta), Telugu (te), Thai (th), Turkish (tr), Ukrainian (uk), Urdu (ur), Uzbek (uz), Vietnamese (vi), Welsh (cy), Xhosa (xh), Yiddish (yi), Yoruba (yo), and Zulu (zu).

Imagen 3

Imagen 3 supports the following languages:

English (en), Chinese (Simplified and Traditional) (zh), Hindi (hi), Japanese (ja), Korean (ko), Portuguese (pt), and Spanish (es).

MedLM

The MedLM model supports the English (en) language.

Explore all models in Model Garden

Model Garden is a platform that helps you discover, test, customize, and deploy Google proprietary and select OSS models and assets. To explore the generative AI models and APIs that are available on Vertex AI, go to Model Garden in the Google Cloud console.

Go to Model Garden

To learn more about Model Garden, including available models and capabilities, see Explore AI models in Model Garden.

Model versions

To see all model versions, including legacy and retired models, see Model versions and lifecycle.

What's next

Try a quickstart tutorial using Vertex AI Studio or the Vertex AI API.
Explore pretrained models in Model Garden.
Learn how to control access to specific models in Model Garden by using a Model Garden organization policy.
Learn about pricing.