Vertex AI in express mode REST API reference

Vertex AI in express mode lets you try a subset of Vertex AI features by using only an express mode API key. This page shows you the REST resources available for Vertex AI in express mode.

Unlike the standard REST resource endpoints on Google Cloud, endpoints that are available when using Vertex AI in express mode use the global endpoint aiplatform.googleapis.com and don't include projects or locations. For example, the following shows the difference between standard and express mode endpoints for the datasets resource:

Standard Vertex AI endpoint format: https://{location}-aiplatform.googleapis.com/v1/projects/{project}/locations/{location}/{model}:generateContent

Endpoint format for Vertex AI in express mode: https://aiplatform.googleapis.com/v1/{model}:generateContent

REST Resource: v1.publishers.models

Methods
countTokens POST /v1/{endpoint}:countTokens
Perform a token counting.
generateContent POST /v1/{model}:generateContent
Generate content with multimodal inputs.
streamGenerateContent POST /v1/{model}:streamGenerateContent
Generate content with multimodal inputs with streaming support.

REST Resource: v1beta1.publishers.models

Methods
countTokens POST /v1beta1/{endpoint}:countTokens
Perform a token counting.
generateContent POST /v1beta1/{model}:generateContent
Generate content with multimodal inputs.
streamGenerateContent POST /v1beta1/{model}:streamGenerateContent
Generate content with multimodal inputs with streaming support.