Use Llama models

Llama is a collection of open models developed by Meta that you can fine-tune and deploy on Vertex AI. Llama offers pre-trained and instruction-tuned generative text models for assistant-like chat. You can deploy Llama 3.1, Llama 3, and Llama 2 models on Vertex AI.

Llama 3.1

Llama 3.1 collection of multilingual large language models (LLMs) is a collection of pre-trained and instruction-tuned generative models in 8B, 70B and 405B sizes (text in/text out). The Llama 3.1 instruction tuned text-only models (8B, 70B, 405B) are optimized for multilingual dialogue use cases and outperform many of the available open source and closed chat models on common industry benchmarks.

For more information see the Llama 3.1 model card in Model Garden.

Llama 3

The Llama 3 instruction-tuned models are a collection of LLMs optimized for dialogue use cases. Llama 3 models outperform many of the available open source chat models on common industry benchmarks.

For more information, see the Llama 3 model card in Model Garden.

Llama 2

The Llama 2 LLMs is a collection of pre-trained and fine-tuned generative text models, ranging in size from 7B to 70B parameters.

For more information, see the Llama 2 model card in Model Garden.

Code Llama

Meta's Code Llama models are designed for code synthesis, understanding, and instruction.

For more information, see the Code Llama model card in Model Garden.

Llama Guard 3

Llama Guard 3 builds on the capabilities of Llama Guard 2, adding three new categories: Defamation, Elections, and Code Interpreter Abuse. Additionally, this model is multilingual and has a prompt format that is consistent with Llama 3 or later instruct models.

For more information, see the Llama Guard model card in Model Garden.

Resources

For more information about Model Garden, see Explore AI models in Model Garden.