Hugging Face provides pre-trained models, fine-tuning scripts, and development APIs that make the process of creating and discovering LLMs easier. Model Garden can serve Text Embeddings, Text To Image, Text Generation, and Image Text To Text models in HuggingFace. You can deploy supported Hugging Face models in Vertex AI or
Google Kubernetes Engine (GKE). The deployment option
you choose can depend on the model you're using and how much control
you want over your workloads. Vertex AI offers a managed platform for building and scaling
machine learning projects without in-house MLOps expertise. You can use
Vertex AI as the downstream application that serves the
Hugging Face models. We recommend using
Vertex AI if you want end-to-end MLOps capabilities, value-added ML
features, and a serverless experience for streamlined development. To deploy a supported Hugging Face model in Vertex AI, go
to Model Garden. Go to the Open models on Hugging Face section and click Show
more. Find and select a model to deploy. Optional: For the Deployment environment, select Vertex AI. Optional: Specify the deployment details. Click Deploy. To get started, see the following examples: Google Kubernetes Engine (GKE) is the Google Cloud solution
for managed Kubernetes that provides scalability, security, resilience, and cost
effectiveness. We recommend this option if you have existing Kubernetes
investments, your organization has in-house MLOps expertise, or if you need
granular control over complex AI/ML workloads with unique security, data
pipeline, and resource management requirements. To deploy a supported Hugging Face model in GKE, go
to Model Garden. Go to the Open models on Hugging Face section and click Show
more. Find and select a model to deploy. For the Deployment environment, select GKE. Follow the deployment instructions. To get started, see the following examples: We automatically add the latest, most popular Hugging Face models to Model Garden.
This process includes the automatic generation of a deployment configuration for
each model. To address concerns regarding vulnerabilities and malicious code, we
use the Hugging Face Malware Scanner
to assess the safety
of files within each Hugging Face model repository on a daily basis. If a
model repository is flagged as containing malware, we immediately remove the
model from the Hugging Face gallery page. While a model being designated as supported by Vertex AI signifies that it
has undergone testing and is deployable on Vertex AI, we don't guarantee
the absence of vulnerabilities or malicious code. We recommend that you conduct
your own security verifications before deploying any model in your production
environment. The default deployment configuration that is provided
with the one-click deployment option can't satisfy every requirement
given the diverse range of use cases and varying
priorities with latency, throughput, cost, and accuracy. Therefore, you can initially experiment with the one-click
deployment to establish a baseline, and then fine-tune the deployment
configurations by using the Colab notebook (vLLM,
TGI,
TEI,
HF pytorch inference)
or the Python SDK. This iterative approach lets you to tailor the
deployment to your precise needs to get the best possible performance for
your specific application. If you're looking for a specific model that's not listed in
Model Garden, the model is not supported by
Vertex AI. The following sections describe the reasoning and what you can
do. The following reasons explain why a model might not be in Model Garden: You can still work with models that available in Model Garden:Deployment options for Hugging Face models
Deploy in Vertex AI
Deploy in GKE
What does "Supported by Vertex AI" mean?
Tune deployment configurations for specific use cases
What should you do if the model you want isn't listed in Model Garden
Why isn't the model listed?
text-generation
and image-text-to-text
models.text-generation
,
text2text-generation
, text-to-image
, feature-extraction
, sentence-similarity
,
and image-text-to-text
.What are your options?
Use Hugging Face Models
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2025-08-18 UTC.