Method: projects.locations.generateGroundedContent

HTTP request
Path parameters
Request body
- JSON representation
Response body
- JSON representation
Authorization scopes
GroundedGenerationContent
- JSON representation
Part
- JSON representation
GenerationSpec
- JSON representation
GroundingSpec
- JSON representation
GroundingSource
- JSON representation
InlineSource
- JSON representation
SearchSource
- JSON representation
GoogleSearchSource
- JSON representation
DynamicRetrievalConfiguration
- JSON representation
DynamicRetrievalPredictor
- JSON representation
Version
Candidate
- JSON representation
GroundingMetadata
- JSON representation
RetrievalMetadata
- JSON representation
Source
DynamicRetrievalMetadata
- JSON representation
DynamicRetrievalPredictorMetadata
- JSON representation
Version
SearchEntryPoint
- JSON representation
GroundingSupport
- JSON representation

Generates grounded content.

HTTP request

POST https://discoveryengine.googleapis.com/v1alpha/{location=projects/*/locations/*}:generateGroundedContent

The URL uses gRPC Transcoding syntax.

Path parameters

Parameters

Parameters
`location`	`string` Required. Location resource. Format: `projects/{project}/locations/{location}`.

location

string

Required. Location resource.

Format: projects/{project}/locations/{location}.

Request body

The request body contains data with the following structure:

JSON representation

JSON representation
{ "systemInstruction": { object (`GroundedGenerationContent`) }, "contents": [ { object (`GroundedGenerationContent`) } ], "generationSpec": { object (`GenerationSpec`) }, "groundingSpec": { object (`GroundingSpec`) }, "userLabels": { string: string, ... } }

{
  "systemInstruction": {
    object (GroundedGenerationContent)
  },
  "contents": [
    {
      object (GroundedGenerationContent)
    }
  ],
  "generationSpec": {
    object (GenerationSpec)
  },
  "groundingSpec": {
    object (GroundingSpec)
  },
  "userLabels": {
    string: string,
    ...
  }
}

Fields
`systemInstruction`	`object (GroundedGenerationContent)` Content of the system instruction for the current API. These instructions will take priority over any other prompt instructions if the selected model is supporting them.
`contents[]`	`object (GroundedGenerationContent)` Content of the current conversation with the model. For single-turn queries, this is a single instance. For multi-turn queries, this is a repeated field that contains conversation history + latest request.
`generationSpec`	`object (GenerationSpec)` Content generation specification.
`groundingSpec`	`object (GroundingSpec)` Grounding specification.
`userLabels`	`map (key: string, value: string)` The user labels applied to a resource must meet the following requirements: Each resource can have multiple labels, up to a maximum of 64. Each label must be a key-value pair. Keys have a minimum length of 1 character and a maximum length of 63 characters and cannot be empty. Values can be empty and have a maximum length of 63 characters. Keys and values can contain only lowercase letters, numeric characters, underscores, and dashes. All characters must use UTF-8 encoding, and international characters are allowed. The key portion of a label must be unique. However, you can use the same key with multiple resources. Keys must start with a lowercase letter or international character. See Google Cloud Document for more details.

Response body

Response for the locations.generateGroundedContent method.

If successful, the response body contains data with the following structure:

JSON representation
{ "candidates": [ { object (`Candidate`) } ] }

Fields

Fields
`candidates[]`	`object (Candidate)` Generated candidates.

candidates[]

object (Candidate)

Generated candidates.

Authorization scopes

Requires the following OAuth scope:

https://www.googleapis.com/auth/cloud-platform

For more information, see the Authentication Overview.

GroundedGenerationContent

Base structured datatype containing multi-part content of a message.

JSON representation
{ "role": string, "parts": [ { object (`Part`) } ] }

Fields

Fields
`role`	`string` producer of the content. Must be either `user` or `model`. Intended to be used for multi-turn conversations. Otherwise, it can be left unset.
`parts[]`	`object (Part)` Ordered `Parts` that constitute a single message.

role

string

producer of the content. Must be either user or model.

Intended to be used for multi-turn conversations. Otherwise, it can be left unset.

parts[]

object (Part)

Ordered Parts that constitute a single message.

Part

Single part of content.

JSON representation
{ // Union field `data` can be only one of the following: "text": string // End of list of possible types for union field `data`. }

Fields

Fields
Union field `data`. Holder of data. It only supports text for now. `data` can be only one of the following:
`text`	`string` Inline text.

Union field data. Holder of data. It only supports text for now. data can be only one of the following:

text

string

Inline text.

GenerationSpec

Content generation specification.

JSON representation
{ "modelId": string, "languageCode": string, "temperature": number, "topP": number, "topK": integer, "frequencyPenalty": number, "seed": integer, "presencePenalty": number, "maxOutputTokens": integer }

JSON representation

{
  "modelId": string,
  "languageCode": string,
  "temperature": number,
  "topP": number,
  "topK": integer,
  "frequencyPenalty": number,
  "seed": integer,
  "presencePenalty": number,
  "maxOutputTokens": integer
}

Fields
`modelId`	`string` Specifies which Vertex model id to use for generation.
`languageCode`	`string` Language code for content. Use language tags defined by BCP47.
`temperature`	`number` If specified, custom value for the temperature will be used.
`topP`	`number` If specified, custom value for nucleus sampling will be used.
`topK`	`integer` If specified, custom value for top-k sampling will be used.
`frequencyPenalty`	`number` If specified, custom value for frequency penalty will be used.
`seed`	`integer` If specified, custom value for the seed will be used.
`presencePenalty`	`number` If specified, custom value for presence penalty will be used.
`maxOutputTokens`	`integer` If specified, custom value for max output tokens will be used.

GroundingSpec

Grounding specification.

JSON representation
{ "groundingSources": [ { object (`GroundingSource`) } ] }

Fields

Fields
`groundingSources[]`	`object (GroundingSource)` Grounding sources.

groundingSources[]

object (GroundingSource)

Grounding sources.

GroundingSource

Grounding source.

JSON representation

JSON representation
{ // Union field `source` can be only one of the following: "inlineSource": { object (`InlineSource`) }, "searchSource": { object (`SearchSource`) }, "googleSearchSource": { object (`GoogleSearchSource`) } // End of list of possible types for union field `source`. }

{

  // Union field source can be only one of the following:
  "inlineSource": {
    object (InlineSource)
  },
  "searchSource": {
    object (SearchSource)
  },
  "googleSearchSource": {
    object (GoogleSearchSource)
  }
  // End of list of possible types for union field source.
}

Fields
Union field `source`. Sources. `source` can be only one of the following:
`inlineSource`	`object (InlineSource)` If set, grounding is performed with inline content.
`searchSource`	`object (SearchSource)` If set, grounding is performed with Vertex AI Search.
`googleSearchSource`	`object (GoogleSearchSource)` If set, grounding is performed with Google Search.

InlineSource

Message to be used for grounding based on inline content.

JSON representation
{ "groundingFacts": [ { object (`GroundingFact`) } ], "attributes": { string: string, ... } }

Fields

Fields
`groundingFacts[]`	`object (GroundingFact)` List of facts to be used for grounding.
`attributes`	`map (key: string, value: string)` Attributes associated with the content. Common attributes include `source` (indicating where the content was sourced from) and `author` (indicating the author of the content).

groundingFacts[]

object (GroundingFact)

List of facts to be used for grounding.

attributes

map (key: string, value: string)

Attributes associated with the content.

Common attributes include source (indicating where the content was sourced from) and author (indicating the author of the content).

SearchSource

Message to be used for grounding with Vertex AI Search.

JSON representation
{ "servingConfig": string, "maxResultCount": integer, "filter": string, "safeSearch": boolean }

Fields
`servingConfig`	`string` The resource name of the Engine to use. Format: `projects/{project}/locations/{location}/collections/{collectionId}/engines/{engineId}/servingConfigs/{servingConfigId}`
`maxResultCount`	`integer` Number of search results to return. The default value is 10. The maximumm allowed value is 10.
`filter`	`string` Filter expression to be applied to the search. The syntax is the same as `SearchRequest.filter`.
`safeSearch`	`boolean` If set, safe search is enabled in Vertex AI Search requests.

GoogleSearchSource

Google Search config parameters.

JSON representation
{ "dynamicRetrievalConfig": { object (`DynamicRetrievalConfiguration`) } }

Fields

Fields
`dynamicRetrievalConfig`	`object (DynamicRetrievalConfiguration)` Optional. Specifies the dynamic retrieval configuration for the given source.

dynamicRetrievalConfig

object (DynamicRetrievalConfiguration)

Optional. Specifies the dynamic retrieval configuration for the given source.

DynamicRetrievalConfiguration

Describes the options to customize dynamic retrieval.

JSON representation
{ "predictor": { object (`DynamicRetrievalPredictor`) } }

Fields

Fields
`predictor`	`object (DynamicRetrievalPredictor)` Specification for the predictor for dynamic retrieval.

predictor

object (DynamicRetrievalPredictor)

Specification for the predictor for dynamic retrieval.

DynamicRetrievalPredictor

Describes the predictor settings for dynamic retrieval.

JSON representation
{ "version": enum (`Version`), "threshold": number }

Fields

Fields
`version`	`enum (Version)` The version of the predictor to be used in dynamic retrieval.
`threshold`	`number` The value of the threshold. If the predictor will predict a value smaller than this, it would suppress grounding in the source.

version

enum (Version)

The version of the predictor to be used in dynamic retrieval.

threshold

number

The value of the threshold. If the predictor will predict a value smaller than this, it would suppress grounding in the source.

Version

The version of the predictor to be used in dynamic retrieval.

Enums
`VERSION_UNSPECIFIED`	Automatically choose the best version of the retrieval predictor.
`V1_INDEPENDENT`	The V1 model which is evaluating each source independently.

Candidate

A response candidate generated from the model.

JSON representation
{ "index": integer, "content": { object (`GroundedGenerationContent`) }, "groundingMetadata": { object (`GroundingMetadata`) }, "groundingScore": number }

Fields
`index`	`integer` Index of the candidate.
`content`	`object (GroundedGenerationContent)` Content of the candidate.
`groundingMetadata`	`object (GroundingMetadata)` Grounding metadata for the generated content.
`groundingScore`	`number` The overall grounding score for the candidate, in the range of [0, 1].

GroundingMetadata

Citation for the generated content.

JSON representation

JSON representation
{ "retrievalMetadata": [ { object (`RetrievalMetadata`) } ], "supportChunks": [ { object (`FactChunk`) } ], "webSearchQueries": [ string ], "searchEntryPoint": { object (`SearchEntryPoint`) }, "groundingSupport": [ { object (`GroundingSupport`) } ] }

{
  "retrievalMetadata": [
    {
      object (RetrievalMetadata)
    }
  ],
  "supportChunks": [
    {
      object (FactChunk)
    }
  ],
  "webSearchQueries": [
    string
  ],
  "searchEntryPoint": {
    object (SearchEntryPoint)
  },
  "groundingSupport": [
    {
      object (GroundingSupport)
    }
  ]
}

Fields
`retrievalMetadata[]`	`object (RetrievalMetadata)` Retrieval metadata to provide an understanding in the retrieval steps performed by the model. There can be multiple such messages which can correspond to different parts of the retrieval. This is a mechanism used to ensure transparency to our users.
`supportChunks[]`	`object (FactChunk)` List of chunks to be attributed across all claims in the candidate. These are derived from the grounding sources supplied in the request.
`webSearchQueries[]`	`string` Web search queries for the following-up web search.
`searchEntryPoint`	`object (SearchEntryPoint)` Google search entry for the following-up web searches.
`groundingSupport[]`	`object (GroundingSupport)` GroundingSupport across all claims in the answer candidate. An support to a fact indicates that the claim is supported by the fact.

RetrievalMetadata

Describes the metadata associated with a retrieval step.

JSON representation
{ "source": enum (`Source`), "dynamicRetrievalMetadata": { object (`DynamicRetrievalMetadata`) } }

Fields

source

enum (Source)

Describes the source to which the metadata is referring to.

dynamicRetrievalMetadata

object (DynamicRetrievalMetadata)

metadata for dynamic retrieval.

Source

Describes the source to which the metadata is associated to.

Enums
`SOURCE_UNSPECIFIED`	Unspecified source.
`VERTEX_AI_SEARCH`	Vertex AI search.
`GOOGLE_SEARCH`	Google Search.
`INLINE_CONTENT`	User inline provided content.
`GOOGLE_MAPS`	Google Maps.

DynamicRetrievalMetadata

Describes the metadata about dynamic retrieval.

JSON representation
{ "predictorMetadata": { object (`DynamicRetrievalPredictorMetadata`) } }

Fields

predictorMetadata

object (DynamicRetrievalPredictorMetadata)

metadata for the dynamic retrieval predictor.

DynamicRetrievalPredictorMetadata

Describes the metadata about the dynamic retrieval predictor.

JSON representation
{ "version": enum (`Version`), "prediction": number }

Fields

version

enum (Version)

The version of the predictor which was used in dynamic retrieval.

prediction

number

The value of the predictor. This should be between [0, 1] where a value of 0 means that the query would not benefit from grounding, while a value of 1.0 means that the query would benefit the most. In between values allow to differentiate between different usefulness scores for grounding.

Version

The version of the predictor which was used in dynamic retrieval.

Enums
`VERSION_UNSPECIFIED`	Unspecified version, should never be used.
`V1_INDEPENDENT`	The V1 model which is evaluating each source independently.

SearchEntryPoint

Google search entry point.

JSON representation
{ "renderedContent": string, "sdkBlob": string }

Fields

Fields
`renderedContent`	`string` Web content snippet that can be embedded in a web page or an app webview.
`sdkBlob`	`string (bytes format)` Base64 encoded JSON representing array of <search term, search url> tuple. A base64-encoded string.

renderedContent

string

Web content snippet that can be embedded in a web page or an app webview.

sdkBlob

string (bytes format)

Base64 encoded JSON representing array of <search term, search url> tuple.

A base64-encoded string.

GroundingSupport

Grounding info for a claim in the candidate and its support.

JSON representation
{ "claimText": string, "supportChunkIndices": [ integer ], "supportScore": number }

Fields

Fields
`claimText`	`string` Text for the claim in the candidate. Always provided when a support is found.
`supportChunkIndices[]`	`integer` A list of indices (into 'supportChunks') specifying the citations associated with the claim. For instance [1,3,4] means that supportChunks[1], supportChunks[3], supportChunks[4] are the chunks attributed to the claim.
`supportScore`	`number` A score in the range of [0, 1] describing how grounded is a specific claim in the support chunks indicated. Higher value means that the claim is better supported by the chunks.

claimText

string

Text for the claim in the candidate. Always provided when a support is found.

supportChunkIndices[]

integer

A list of indices (into 'supportChunks') specifying the citations associated with the claim. For instance [1,3,4] means that supportChunks[1], supportChunks[3], supportChunks[4] are the chunks attributed to the claim.

supportScore

number

A score in the range of [0, 1] describing how grounded is a specific claim in the support chunks indicated. Higher value means that the claim is better supported by the chunks.