Try Gemini 1.5 models, our newest multimodal models in Vertex AI, and see what you can build with a 1M token context window. Try Gemini 1.5 models, our newest multimodal models in Vertex AI, and see what you can build with a 1M token context window.

About supervised tuning

Supervised fine-tuning is a good option when you have a well-defined task with available labeled data. It's particularly effective for domain-specific applications where the language or content significantly differs from the data the large model was originally trained on.

Supervised tuning adapts model behavior with a labeled dataset. This process adjusts the model's weights to minimize the difference between its predictions and the actual labels. It can improve model performance for the following types of tasks:

Classification
Sentiment analysis
Entity extraction
Summarization of content that's not complex
Writing domain-specific queries.

Supported models

The following Gemini text models support supervised tuning:

gemini-1.0-pro-002.

Use cases for using supervised tuning on text models

Foundation text models work well when the expected output or task can be clearly and concisely defined in a prompt and the prompt consistently produces the expected output. If you want a model to learn something niche or specific that deviates from general language patterns, then you might want to consider tuning that model. For example, you can use model tuning to teach the model the following:

Specific structures or formats for generating output.
Specific behaviors such as when to provide a terse or verbose output.
Specific customized outputs for specific types of inputs.

The following examples are use cases that are difficult to capture with only prompt instructions:

Classification: The expected response is a specific word or phrase.

Prompt: Classify the following text into one of the following classes: [business, entertainment]. Text: Diversify your investment portfolio

Response: business

Tuning the model can help prevent the model from generating verbose responses.
Summarization: The summary follows a specific format. For example, you might need to remove personally identifiable information (PII) in a chat summary.

Prompt: Summarize: Jessica: That sounds great! See you in Times Square! Alexander: See you at 10!

Response: #Person1 and #Person2 agree to meet at Times Square at 10:00 AM.

This formatting of replacing the names of the speakers with #Person1 and #Person2 is difficult to describe and the foundation model might not naturally produce such a response.

Extractive question answering: The question is about a context and the answer is a substring of the context.

Prompt: Context: There is evidence that there have been significant changes in Amazon rainforest vegetation over the last 21,000 years through the Last Glacial Maximum (LGM) and subsequent deglaciation. Question: What does LGM stand for?

Response: Last Glacial Maximum

The response "Last Glacial Maximum" is a specific phrase from the context.

Chat: You need to customize model response to follow a persona, role, or character.

Prompt: User: What's the weather like today?

Response: Assistant: As the virtual shopkeeper of Example Organization, I can only help you with the purchases and shipping.

You can also tune a model in the following situations:

Prompts are not producing the expected results consistently enough.
The task is too complicated to define in a prompt. For example, you want the model to do behavior cloning for a behavior that's hard to articulate in a prompt.
You have complex intuitions about a task that are easy to elicit but difficult to formalize in a prompt.
You want to reduce the context length by removing the few-shot examples.

What's next

Prepare a supervised tuning dataset.