Index
EvaluationService
(interface)ExtensionExecutionService
(interface)ExtensionRegistryService
(interface)GenAiCacheConfigService
(interface)GenAiCacheService
(interface)GenAiTuningService
(interface)LlmBidiService
(interface)PredictionService
(interface)ReasoningEngineExecutionService
(interface)ReasoningEngineService
(interface)VertexRagDataService
(interface)VertexRagService
(interface)ApiAuth
(message)ApiAuth.ApiKeyConfig
(message)AugmentPromptRequest
(message)AugmentPromptRequest.Model
(message)AugmentPromptResponse
(message)AuthConfig
(message)AuthConfig.ApiKeyConfig
(message)AuthConfig.GoogleServiceAccountConfig
(message)AuthConfig.HttpBasicAuthConfig
(message)AuthConfig.OauthConfig
(message)AuthConfig.OidcConfig
(message)AuthType
(enum)BidiGenerateContentClientContent
(message)BidiGenerateContentClientMessage
(message)BidiGenerateContentRealtimeInput
(message)BidiGenerateContentServerContent
(message)BidiGenerateContentServerMessage
(message)BidiGenerateContentSetup
(message)BidiGenerateContentSetupComplete
(message)BidiGenerateContentToolCall
(message)BidiGenerateContentToolCallCancellation
(message)BidiGenerateContentToolResponse
(message)BigQueryDestination
(message)BleuInput
(message)BleuInstance
(message)BleuMetricValue
(message)BleuResults
(message)BleuSpec
(message)Blob
(message)CacheConfig
(message)CachedContent
(message)CachedContent.UsageMetadata
(message)CancelTuningJobRequest
(message)Candidate
(message)Candidate.FinishReason
(enum)ChatCompletionsRequest
(message)Citation
(message)CitationMetadata
(message)Claim
(message)CodeExecutionResult
(message)CodeExecutionResult.Outcome
(enum)CoherenceInput
(message)CoherenceInstance
(message)CoherenceResult
(message)CoherenceSpec
(message)CometInput
(message)CometInstance
(message)CometResult
(message)CometSpec
(message)CometSpec.CometVersion
(enum)Content
(message)CorpusStatus
(message)CorpusStatus.State
(enum)CorroborateContentRequest
(message)CorroborateContentRequest.Parameters
(message)CorroborateContentResponse
(message)CountTokensRequest
(message)CountTokensResponse
(message)CreateCachedContentRequest
(message)CreateRagCorpusOperationMetadata
(message)CreateRagCorpusRequest
(message)CreateReasoningEngineOperationMetadata
(message)CreateReasoningEngineRequest
(message)CreateTuningJobRequest
(message)DatasetDistribution
(message)DatasetDistribution.DistributionBucket
(message)DatasetStats
(message)DeleteCachedContentRequest
(message)DeleteExtensionRequest
(message)DeleteOperationMetadata
(message)DeleteRagCorpusRequest
(message)DeleteRagFileRequest
(message)DeleteReasoningEngineRequest
(message)DirectUploadSource
(message)DistillationDataStats
(message)DistillationHyperParameters
(message)DistillationSpec
(message)DynamicRetrievalConfig
(message)DynamicRetrievalConfig.Mode
(enum)EncryptionSpec
(message)EvaluateInstancesRequest
(message)EvaluateInstancesResponse
(message)ExactMatchInput
(message)ExactMatchInstance
(message)ExactMatchMetricValue
(message)ExactMatchResults
(message)ExactMatchSpec
(message)ExecutableCode
(message)ExecutableCode.Language
(enum)ExecuteExtensionRequest
(message)ExecuteExtensionResponse
(message)Extension
(message)ExtensionManifest
(message)ExtensionManifest.ApiSpec
(message)ExtensionOperation
(message)ExtensionPrivateServiceConnectConfig
(message)Fact
(message)FetchPredictOperationRequest
(message)FileData
(message)FileStatus
(message)FileStatus.State
(enum)FluencyInput
(message)FluencyInstance
(message)FluencyResult
(message)FluencySpec
(message)FulfillmentInput
(message)FulfillmentInstance
(message)FulfillmentResult
(message)FulfillmentSpec
(message)FunctionCall
(message)FunctionCallingConfig
(message)FunctionCallingConfig.Mode
(enum)FunctionDeclaration
(message)FunctionResponse
(message)GcsDestination
(message)GcsSource
(message)GenerateContentRequest
(message)GenerateContentResponse
(message)GenerateContentResponse.PromptFeedback
(message)GenerateContentResponse.PromptFeedback.BlockedReason
(enum)GenerateContentResponse.UsageMetadata
(message)GenerationConfig
(message)GenerationConfig.MediaResolution
(enum)GenerationConfig.Modality
(enum)GenerationConfig.RoutingConfig
(message)GenerationConfig.RoutingConfig.AutoRoutingMode
(message)GenerationConfig.RoutingConfig.AutoRoutingMode.ModelRoutingPreference
(enum)GenerationConfig.RoutingConfig.ManualRoutingMode
(message)GenericOperationMetadata
(message)GetCacheConfigRequest
(message)GetCachedContentRequest
(message)GetExtensionRequest
(message)GetRagCorpusRequest
(message)GetRagFileRequest
(message)GetReasoningEngineRequest
(message)GetTuningJobRequest
(message)GoogleDriveSource
(message)GoogleDriveSource.ResourceId
(message)GoogleDriveSource.ResourceId.ResourceType
(enum)GoogleSearchRetrieval
(message)GroundednessInput
(message)GroundednessInstance
(message)GroundednessResult
(message)GroundednessSpec
(message)GroundingChunk
(message)GroundingChunk.RetrievedContext
(message)GroundingChunk.Web
(message)GroundingMetadata
(message)GroundingSupport
(message)HarmCategory
(enum)HttpElementLocation
(enum)ImportExtensionOperationMetadata
(message)ImportExtensionRequest
(message)ImportRagFilesConfig
(message)ImportRagFilesOperationMetadata
(message)ImportRagFilesRequest
(message)ImportRagFilesResponse
(message)JiraSource
(message)JiraSource.JiraQueries
(message)JobState
(enum)ListCachedContentsRequest
(message)ListCachedContentsResponse
(message)ListExtensionsRequest
(message)ListExtensionsResponse
(message)ListRagCorporaRequest
(message)ListRagCorporaResponse
(message)ListRagFilesRequest
(message)ListRagFilesResponse
(message)ListReasoningEnginesRequest
(message)ListReasoningEnginesResponse
(message)ListTuningJobsRequest
(message)ListTuningJobsResponse
(message)LogprobsResult
(message)LogprobsResult.Candidate
(message)LogprobsResult.TopCandidates
(message)MetricxInput
(message)MetricxInstance
(message)MetricxResult
(message)MetricxSpec
(message)MetricxSpec.MetricxVersion
(enum)PairwiseChoice
(enum)PairwiseMetricInput
(message)PairwiseMetricInstance
(message)PairwiseMetricResult
(message)PairwiseMetricSpec
(message)PairwiseQuestionAnsweringQualityInput
(message)PairwiseQuestionAnsweringQualityInstance
(message)PairwiseQuestionAnsweringQualityResult
(message)PairwiseQuestionAnsweringQualitySpec
(message)PairwiseSummarizationQualityInput
(message)PairwiseSummarizationQualityInstance
(message)PairwiseSummarizationQualityResult
(message)PairwiseSummarizationQualitySpec
(message)Part
(message)PartnerModelTuningSpec
(message)PointwiseMetricInput
(message)PointwiseMetricInstance
(message)PointwiseMetricResult
(message)PointwiseMetricSpec
(message)PrebuiltVoiceConfig
(message)PredictLongRunningRequest
(message)PredictRequest
(message)PredictResponse
(message)QueryExtensionRequest
(message)QueryExtensionResponse
(message)QueryReasoningEngineRequest
(message)QueryReasoningEngineResponse
(message)QuestionAnsweringCorrectnessInput
(message)QuestionAnsweringCorrectnessInstance
(message)QuestionAnsweringCorrectnessResult
(message)QuestionAnsweringCorrectnessSpec
(message)QuestionAnsweringHelpfulnessInput
(message)QuestionAnsweringHelpfulnessInstance
(message)QuestionAnsweringHelpfulnessResult
(message)QuestionAnsweringHelpfulnessSpec
(message)QuestionAnsweringQualityInput
(message)QuestionAnsweringQualityInstance
(message)QuestionAnsweringQualityResult
(message)QuestionAnsweringQualitySpec
(message)QuestionAnsweringRelevanceInput
(message)QuestionAnsweringRelevanceInstance
(message)QuestionAnsweringRelevanceResult
(message)QuestionAnsweringRelevanceSpec
(message)RagContexts
(message)RagContexts.Context
(message)RagCorpus
(message)RagEmbeddingModelConfig
(message)RagEmbeddingModelConfig.HybridSearchConfig
(message)RagEmbeddingModelConfig.SparseEmbeddingConfig
(message)RagEmbeddingModelConfig.SparseEmbeddingConfig.Bm25
(message)RagEmbeddingModelConfig.VertexPredictionEndpoint
(message)RagFile
(message)RagFile.RagFileType
(enum)RagFileChunkingConfig
(message)RagFileChunkingConfig.FixedLengthChunking
(message)RagFileParsingConfig
(message)RagFileParsingConfig.AdvancedParser
(message)RagFileParsingConfig.LayoutParser
(message)RagFileParsingConfig.LlmParser
(message)RagFileTransformationConfig
(message)RagQuery
(message)RagQuery.Ranking
(message)RagRetrievalConfig
(message)RagRetrievalConfig.Filter
(message)RagRetrievalConfig.HybridSearch
(message)RagRetrievalConfig.Ranking
(message)RagRetrievalConfig.Ranking.LlmRanker
(message)RagRetrievalConfig.Ranking.RankService
(message)RagVectorDbConfig
(message)RagVectorDbConfig.Pinecone
(message)RagVectorDbConfig.RagManagedDb
(message)RagVectorDbConfig.VertexFeatureStore
(message)RagVectorDbConfig.VertexVectorSearch
(message)RagVectorDbConfig.Weaviate
(message)ReasoningEngine
(message)ReasoningEngineSpec
(message)ReasoningEngineSpec.PackageSpec
(message)RebaseTunedModelOperationMetadata
(message)RebaseTunedModelRequest
(message)Retrieval
(message)RetrievalMetadata
(message)RetrieveContextsRequest
(message)RetrieveContextsRequest.VertexRagStore
(message)RetrieveContextsRequest.VertexRagStore.RagResource
(message)RetrieveContextsResponse
(message)RougeInput
(message)RougeInstance
(message)RougeMetricValue
(message)RougeResults
(message)RougeSpec
(message)RuntimeConfig
(message)RuntimeConfig.CodeInterpreterRuntimeConfig
(message)RuntimeConfig.VertexAISearchRuntimeConfig
(message)SafetyInput
(message)SafetyInstance
(message)SafetyRating
(message)SafetyRating.HarmProbability
(enum)SafetyRating.HarmSeverity
(enum)SafetyResult
(message)SafetySetting
(message)SafetySetting.HarmBlockMethod
(enum)SafetySetting.HarmBlockThreshold
(enum)SafetySpec
(message)Schema
(message)SearchEntryPoint
(message)Segment
(message)SharePointSources
(message)SharePointSources.SharePointSource
(message)SlackSource
(message)SlackSource.SlackChannels
(message)SlackSource.SlackChannels.SlackChannel
(message)SpeechConfig
(message)StreamDirectPredictRequest
(message)StreamDirectPredictResponse
(message)StreamDirectRawPredictRequest
(message)StreamDirectRawPredictResponse
(message)StreamQueryReasoningEngineRequest
(message)StreamingPredictRequest
(message)StreamingPredictResponse
(message)StreamingRawPredictRequest
(message)StreamingRawPredictResponse
(message)SummarizationHelpfulnessInput
(message)SummarizationHelpfulnessInstance
(message)SummarizationHelpfulnessResult
(message)SummarizationHelpfulnessSpec
(message)SummarizationQualityInput
(message)SummarizationQualityInstance
(message)SummarizationQualityResult
(message)SummarizationQualitySpec
(message)SummarizationVerbosityInput
(message)SummarizationVerbosityInstance
(message)SummarizationVerbosityResult
(message)SummarizationVerbositySpec
(message)SupervisedHyperParameters
(message)SupervisedHyperParameters.AdapterSize
(enum)SupervisedTuningDataStats
(message)SupervisedTuningDatasetDistribution
(message)SupervisedTuningDatasetDistribution.DatasetBucket
(message)SupervisedTuningSpec
(message)Tensor
(message)Tensor.DataType
(enum)Tool
(message)Tool.CodeExecution
(message)Tool.GoogleSearch
(message)ToolCall
(message)ToolCallValidInput
(message)ToolCallValidInstance
(message)ToolCallValidMetricValue
(message)ToolCallValidResults
(message)ToolCallValidSpec
(message)ToolConfig
(message)ToolNameMatchInput
(message)ToolNameMatchInstance
(message)ToolNameMatchMetricValue
(message)ToolNameMatchResults
(message)ToolNameMatchSpec
(message)ToolParameterKVMatchInput
(message)ToolParameterKVMatchInstance
(message)ToolParameterKVMatchMetricValue
(message)ToolParameterKVMatchResults
(message)ToolParameterKVMatchSpec
(message)ToolParameterKeyMatchInput
(message)ToolParameterKeyMatchInstance
(message)ToolParameterKeyMatchMetricValue
(message)ToolParameterKeyMatchResults
(message)ToolParameterKeyMatchSpec
(message)ToolUseExample
(message)ToolUseExample.ExtensionOperation
(message)Trajectory
(message)TrajectoryAnyOrderMatchInput
(message)TrajectoryAnyOrderMatchInstance
(message)TrajectoryAnyOrderMatchMetricValue
(message)TrajectoryAnyOrderMatchResults
(message)TrajectoryAnyOrderMatchSpec
(message)TrajectoryExactMatchInput
(message)TrajectoryExactMatchInstance
(message)TrajectoryExactMatchMetricValue
(message)TrajectoryExactMatchResults
(message)TrajectoryExactMatchSpec
(message)TrajectoryInOrderMatchInput
(message)TrajectoryInOrderMatchInstance
(message)TrajectoryInOrderMatchMetricValue
(message)TrajectoryInOrderMatchResults
(message)TrajectoryInOrderMatchSpec
(message)TrajectoryPrecisionInput
(message)TrajectoryPrecisionInstance
(message)TrajectoryPrecisionMetricValue
(message)TrajectoryPrecisionResults
(message)TrajectoryPrecisionSpec
(message)TrajectoryRecallInput
(message)TrajectoryRecallInstance
(message)TrajectoryRecallMetricValue
(message)TrajectoryRecallResults
(message)TrajectoryRecallSpec
(message)TrajectorySingleToolUseInput
(message)TrajectorySingleToolUseInstance
(message)TrajectorySingleToolUseMetricValue
(message)TrajectorySingleToolUseResults
(message)TrajectorySingleToolUseSpec
(message)TunedModel
(message)TunedModelRef
(message)TuningDataStats
(message)TuningJob
(message)Type
(enum)UpdateCacheConfigRequest
(message)UpdateCachedContentRequest
(message)UpdateExtensionRequest
(message)UpdateRagCorpusOperationMetadata
(message)UpdateRagCorpusRequest
(message)UpdateReasoningEngineOperationMetadata
(message)UpdateReasoningEngineRequest
(message)UploadRagFileConfig
(message)VertexAISearch
(message)VertexAiSearchConfig
(message)VertexRagStore
(message)VertexRagStore.RagResource
(message)VideoMetadata
(message)VoiceConfig
(message)
EvaluationService
Vertex AI Online Evaluation Service.
EvaluateInstances |
---|
Evaluates instances based on a given metric.
|
ExtensionExecutionService
A service for Extension execution.
ExecuteExtension |
---|
Executes the request against a given extension.
|
QueryExtension |
---|
Queries an extension with a default controller.
|
ExtensionRegistryService
A service for managing Vertex AI's Extension registry.
DeleteExtension |
---|
Deletes an Extension.
|
GetExtension |
---|
Gets an Extension.
|
ImportExtension |
---|
Imports an Extension.
|
ListExtensions |
---|
Lists Extensions in a location.
|
UpdateExtension |
---|
Updates an Extension.
|
GenAiCacheConfigService
Service for GenAI Cache Config.
GetCacheConfig |
---|
Gets a GenAI cache config.
|
UpdateCacheConfig |
---|
Updates a cache config.
|
GenAiCacheService
Service for managing Vertex AI's CachedContent resource.
CreateCachedContent |
---|
Creates cached content, this call will initialize the cached content in the data storage, and users need to pay for the cache data storage.
|
DeleteCachedContent |
---|
Deletes cached content
|
GetCachedContent |
---|
Gets cached content configurations
|
ListCachedContents |
---|
Lists cached contents in a project
|
UpdateCachedContent |
---|
Updates cached content configurations
|
GenAiTuningService
A service for creating and managing GenAI Tuning Jobs.
CancelTuningJob |
---|
Cancels a TuningJob. Starts asynchronous cancellation on the TuningJob. The server makes a best effort to cancel the job, but success is not guaranteed. Clients can use
|
CreateTuningJob |
---|
Creates a TuningJob. A created TuningJob right away will be attempted to be run.
|
GetTuningJob |
---|
Gets a TuningJob.
|
ListTuningJobs |
---|
Lists TuningJobs in a Location.
|
RebaseTunedModel |
---|
Rebase a TunedModel.
|
LlmBidiService
A service for LLM related bidirectional low latency APIs.
BidiGenerateContent |
---|
Bidirectional streaming predict.
|
PredictionService
A service for online predictions and explanations.
ChatCompletions |
---|
Exposes an OpenAI-compatible endpoint for chat completions.
|
CountTokens |
---|
Perform a token counting.
|
FetchPredictOperation |
---|
Fetch an asynchronous online prediction operation.
|
GenerateContent |
---|
Generate content with multimodal inputs.
|
Predict |
---|
Perform an online prediction.
|
PredictLongRunning |
---|
|
ServerStreamingPredict |
---|
Perform a server-side streaming online prediction request for Vertex LLM streaming.
|
StreamDirectPredict |
---|
Perform a streaming online prediction request to a gRPC model server for Vertex first-party products and frameworks.
|
StreamDirectRawPredict |
---|
Perform a streaming online prediction request to a gRPC model server for custom containers.
|
StreamGenerateContent |
---|
Generate content with multimodal inputs with streaming support.
|
StreamingPredict |
---|
Perform a streaming online prediction request for Vertex first-party products and frameworks.
|
StreamingRawPredict |
---|
Perform a streaming online prediction request through gRPC.
|
ReasoningEngineExecutionService
A service for executing queries on Reasoning Engine.
QueryReasoningEngine |
---|
Queries using a reasoning engine.
|
StreamQueryReasoningEngine |
---|
Streams queries using a reasoning engine.
|
ReasoningEngineService
A service for managing Vertex AI's Reasoning Engines.
CreateReasoningEngine |
---|
Creates a reasoning engine.
|
DeleteReasoningEngine |
---|
Deletes a reasoning engine.
|
GetReasoningEngine |
---|
Gets a reasoning engine.
|
ListReasoningEngines |
---|
Lists reasoning engines in a location.
|
UpdateReasoningEngine |
---|
Updates a reasoning engine.
|
VertexRagDataService
A service for managing user data for RAG.
CreateRagCorpus |
---|
Creates a RagCorpus.
|
DeleteRagCorpus |
---|
Deletes a RagCorpus.
|
DeleteRagFile |
---|
Deletes a RagFile.
|
GetRagCorpus |
---|
Gets a RagCorpus.
|
GetRagFile |
---|
Gets a RagFile.
|
ImportRagFiles |
---|
Import files from Google Cloud Storage or Google Drive into a RagCorpus.
|
ListRagCorpora |
---|
Lists RagCorpora in a Location.
|
ListRagFiles |
---|
Lists RagFiles in a RagCorpus.
|
UpdateRagCorpus |
---|
Updates a RagCorpus.
|
VertexRagService
A service for retrieving relevant contexts.
AugmentPrompt |
---|
Given an input prompt, it returns augmented prompt from vertex rag store to guide LLM towards generating grounded responses.
|
CorroborateContent |
---|
Given an input text, it returns a score that evaluates the factuality of the text. It also extracts and returns claims from the text and provides supporting facts.
|
RetrieveContexts |
---|
Retrieves relevant contexts for a query.
|
ApiAuth
The generic reusable api auth config.
Fields | |
---|---|
Union field auth_config . The auth config. auth_config can be only one of the following: |
|
api_ |
The API secret. |
ApiKeyConfig
The API secret.
Fields | |
---|---|
api_ |
Required. The SecretManager secret version resource name storing API key. e.g. projects/{project}/secrets/{secret}/versions/{version} |
AugmentPromptRequest
Request message for AugmentPrompt.
Fields | |
---|---|
parent |
Required. The resource name of the Location from which to augment prompt. The users must have permission to make a call in the project. Format: |
contents[] |
Optional. Input content to augment, only text format is supported for now. |
model |
Optional. Metadata of the backend deployed model. |
Union field data_source . The data source for retrieving contexts. data_source can be only one of the following: |
|
vertex_ |
Optional. Retrieves contexts from the Vertex RagStore. |
Model
Metadata of the backend deployed model.
Fields | |
---|---|
model |
Optional. The model that the user will send the augmented prompt for content generation. |
model_ |
Optional. The model version of the backend deployed model. |
AugmentPromptResponse
Response message for AugmentPrompt.
Fields | |
---|---|
augmented_ |
Augmented prompt, only text format is supported for now. |
facts[] |
Retrieved facts from RAG data sources. |
AuthConfig
Auth configuration to run the extension.
Fields | |
---|---|
auth_ |
Type of auth scheme. |
Union field
|
|
api_ |
Config for API key auth. |
http_ |
Config for HTTP Basic auth. |
google_ |
Config for Google Service Account auth. |
oauth_ |
Config for user oauth. |
oidc_ |
Config for user OIDC auth. |
ApiKeyConfig
Config for authentication with API key.
Fields | |
---|---|
name |
Required. The parameter name of the API key. E.g. If the API request is "https://example.com/act?api_key= |
api_ |
Optional. The name of the SecretManager secret version resource storing the API key. Format:
|
api_ |
Optional. The API key to be used in the request directly. |
http_ |
Required. The location of the API key. |
GoogleServiceAccountConfig
Config for Google Service Account Authentication.
Fields | |
---|---|
service_ |
Optional. The service account that the extension execution service runs as.
|
HttpBasicAuthConfig
Config for HTTP Basic Authentication.
Fields | |
---|---|
credential_ |
Required. The name of the SecretManager secret version resource storing the base64 encoded credentials. Format:
|
OauthConfig
Config for user oauth.
Fields | |
---|---|
Union field
|
|
access_ |
Access token for extension endpoint. Only used to propagate token from [[ExecuteExtensionRequest.runtime_auth_config]] at request time. |
service_ |
The service account used to generate access tokens for executing the Extension.
|
OidcConfig
Config for user OIDC auth.
Fields | |
---|---|
Union field
|
|
id_ |
OpenID Connect formatted ID token for extension endpoint. Only used to propagate token from [[ExecuteExtensionRequest.runtime_auth_config]] at request time. |
service_ |
The service account used to generate an OpenID Connect (OIDC)-compatible JWT token signed by the Google OIDC Provider (accounts.google.com) for extension endpoint (https://cloud.google.com/iam/docs/create-short-lived-credentials-direct#sa-credentials-oidc).
|
AuthType
Type of Auth.
Enums | |
---|---|
AUTH_TYPE_UNSPECIFIED |
|
NO_AUTH |
No Auth. |
API_KEY_AUTH |
API Key Auth. |
HTTP_BASIC_AUTH |
HTTP Basic Auth. |
GOOGLE_SERVICE_ACCOUNT_AUTH |
Google Service Account Auth. |
OAUTH |
OAuth auth. |
OIDC_AUTH |
OpenID Connect (OIDC) Auth. |
BidiGenerateContentClientContent
Incremental update of the current conversation delivered from the client. All the content here will unconditionally be appended to the conversation history and used as part of the prompt to the model to generate content.
A message here will interrupt any current model generation.
Fields | |
---|---|
turns[] |
Optional. The content appended to the current conversation with the model. For single-turn queries, this is a single instance. For multi-turn queries, this is a repeated field that contains conversation history + latest request. |
turn_ |
Optional. If true, indicates that the server content generation should start with the currently accumulated prompt. Otherwise, the server will await additional messages before starting generation. |
BidiGenerateContentClientMessage
Messages sent by the client in the BidiGenerateContent RPC call.
Fields | |
---|---|
Union field message_type . The type of the message. message_type can be only one of the following: |
|
setup |
Optional. Message to be sent in the first and only first client message. |
client_ |
Optional. Incremental update of the current conversation delivered from the client. |
realtime_ |
Optional. User input that is sent in real time. |
tool_ |
Optional. Response to a |
BidiGenerateContentRealtimeInput
User input that is sent in real time.
This is different from ClientContentUpdate
in a few ways: - Can be sent continuously without interruption the model generation. - If there is a need to mix data interleaved across the ClientContentUpdate
and the RealtimeUpdate
, server will attempt to optimize for best response, but there are no guarnatees. - End of turn is not explicitly specified, but is rather derived from user activity, e.g. end of speech. - Even before the end of turn, the data will be processed incrementally to optimize for a fast start of the response from the model. - Is always assumed to be the user's input (cannot be used to populate conversation history).
Fields | |
---|---|
media_ |
Optional. Inlined bytes data for media input. |
BidiGenerateContentServerContent
Incremental server update generated by the model in response to client messages.
Content is generated as quickly as possible, and not in realtime. Clients may choose to buffer and play it out in realtime.
Fields | |
---|---|
turn_ |
Output only. If true, indicates that the model is done generating. Generation will only start in response to additional client messages. Can be set alongside |
interrupted |
Output only. If true, indicates that a client message has interrupted current model generation. If the client is playing out the content in realtime, this is a good signal to stop and empty the current queue. If the client is playing out the content in realtime, this is a good signal to stop and empty the current playback queue. |
grounding_ |
Output only. Metadata specifies sources used to ground generated content. |
model_ |
Output only. The content that the model has generated as part of the current conversation with the user. |
BidiGenerateContentServerMessage
Response message for BidiGenerateContent RPC call.
Fields | |
---|---|
Union field message_type . The type of the message. message_type can be only one of the following: |
|
setup_ |
Output only. Sent in response to a |
server_ |
Output only. Content generated by the model in response to client messages. |
tool_ |
Output only. Request for the client to execute the |
tool_ |
Output only. Notification for the client that a previously issued |
BidiGenerateContentSetup
Message to be sent in the first and only first BidiGenerateContentClientMessage
. Contains configuration that will apply for the duration of the streaming RPC.
Clients should wait for a BidiGenerateContentSetupComplete
message before sending any additional messages.
Fields | |
---|---|
model |
Required. The fully qualified name of the publisher model or tuned model endpoint to use. Publisher model format: |
generation_ |
Optional. Generation config. The following fields are not supported: - response_logprobs - response_mime_type - logprobs - response_schema - stop_sequence - routing_config - audio_timestamp |
system_ |
Optional. The user provided system instructions for the model. Note: only text should be used in parts and content in each part will be in a separate paragraph. |
tools[] |
Optional. A list of A |
BidiGenerateContentSetupComplete
This type has no fields.
Sent in response to a BidiGenerateContentSetup
message from the client.
BidiGenerateContentToolCall
Request for the client to execute the function_calls
and return the responses with the matching id
s.
Fields | |
---|---|
function_ |
Output only. The function call to be executed. |
BidiGenerateContentToolCallCancellation
Notification for the client that a previously issued ToolCallMessage
with the specified id
s should have been not executed and should be cancelled. If there were side-effects to those tool calls, clients may attempt to undo the tool calls. This message occurs only in cases where the clients interrupt server turns.
Fields | |
---|---|
ids[] |
Output only. The ids of the tool calls to be cancelled. |
BidiGenerateContentToolResponse
Client generated response to a ToolCall
received from the server. Individual FunctionResponse
objects are matched to the responective FunctionCall
objects by the id
field.
Note that in the unary and server-streaming GenerateContent APIs function calling happens by exchanging the Content
parts, while in the bidi GenerateContent APIs function calling happens over these dedicated set of messages.
Fields | |
---|---|
function_ |
Optional. The response to the function calls. |
BigQueryDestination
The BigQuery location for the output content.
Fields | |
---|---|
output_ |
Required. BigQuery URI to a project or table, up to 2000 characters long. When only the project is specified, the Dataset and Table is created. When the full table reference is specified, the Dataset must exist and table must not exist. Accepted forms:
|
BleuInput
Input for bleu metric.
Fields | |
---|---|
metric_ |
Required. Spec for bleu score metric. |
instances[] |
Required. Repeated bleu instances. |
BleuInstance
Spec for bleu instance.
Fields | |
---|---|
prediction |
Required. Output of the evaluated model. |
reference |
Required. Ground truth used to compare against the prediction. |
BleuMetricValue
Bleu metric value for an instance.
Fields | |
---|---|
score |
Output only. Bleu score. |
BleuResults
Results for bleu metric.
Fields | |
---|---|
bleu_ |
Output only. Bleu metric values. |
BleuSpec
Spec for bleu score metric - calculates the precision of n-grams in the prediction as compared to reference - returns a score ranging between 0 to 1.
Fields | |
---|---|
use_ |
Optional. Whether to use_effective_order to compute bleu score. |
Blob
Content blob.
Fields | |
---|---|
mime_ |
Required. The IANA standard MIME type of the source data. |
data |
Required. Raw bytes. |
CacheConfig
Config of GenAI caching features. This is a singleton resource.
Fields | |
---|---|
name |
Identifier. Name of the cache config. Format: - |
disable_ |
If set to true, disables GenAI caching. Otherwise caching is enabled. |
CachedContent
A resource used in LLM queries for users to explicitly specify what to cache and how to cache.
Fields | |
---|---|
name |
Immutable. Identifier. The server-generated resource name of the cached content Format: projects/{project}/locations/{location}/cachedContents/{cached_content} |
display_ |
Optional. Immutable. The user-generated meaningful display name of the cached content. |
model |
Immutable. The name of the publisher model to use for cached content. Format: projects/{project}/locations/{location}/publishers/{publisher}/models/{model} |
system_ |
Optional. Input only. Immutable. Developer set system instruction. Currently, text only |
contents[] |
Optional. Input only. Immutable. The content to cache |
tools[] |
Optional. Input only. Immutable. A list of |
tool_ |
Optional. Input only. Immutable. Tool config. This config is shared for all tools |
create_ |
Output only. Creatation time of the cache entry. |
update_ |
Output only. When the cache entry was last updated in UTC time. |
usage_ |
Output only. Metadata on the usage of the cached content. |
Union field expiration . Expiration time of the cached content. expiration can be only one of the following: |
|
expire_ |
Timestamp of when this resource is considered expired. This is always provided on output, regardless of what was sent on input. |
ttl |
Input only. The TTL for this resource. The expiration time is computed: now + TTL. |
UsageMetadata
Metadata on the usage of the cached content.
Fields | |
---|---|
total_ |
Total number of tokens that the cached content consumes. |
text_ |
Number of text characters. |
image_ |
Number of images. |
video_ |
Duration of video in seconds. |
audio_ |
Duration of audio in seconds. |
CancelTuningJobRequest
Request message for GenAiTuningService.CancelTuningJob
.
Fields | |
---|---|
name |
Required. The name of the TuningJob to cancel. Format: |
Candidate
A response candidate generated from the model.
Fields | |
---|---|
index |
Output only. Index of the candidate. |
content |
Output only. Content parts of the candidate. |
avg_ |
Output only. Average log probability score of the candidate. |
logprobs_ |
Output only. Log-likelihood scores for the response tokens and top tokens |
finish_ |
Output only. The reason why the model stopped generating tokens. If empty, the model has not stopped generating the tokens. |
safety_ |
Output only. List of ratings for the safety of a response candidate. There is at most one rating per category. |
citation_ |
Output only. Source attribution of the generated content. |
grounding_ |
Output only. Metadata specifies sources used to ground generated content. |
finish_ |
Output only. Describes the reason the mode stopped generating tokens in more detail. This is only filled when |
FinishReason
The reason why the model stopped generating tokens. If empty, the model has not stopped generating the tokens.
Enums | |
---|---|
FINISH_REASON_UNSPECIFIED |
The finish reason is unspecified. |
STOP |
Token generation reached a natural stopping point or a configured stop sequence. |
MAX_TOKENS |
Token generation reached the configured maximum output tokens. |
SAFETY |
Token generation stopped because the content potentially contains safety violations. NOTE: When streaming, content is empty if content filters blocks the output. |
RECITATION |
The token generation stopped because of potential recitation. |
OTHER |
All other reasons that stopped the token generation. |
BLOCKLIST |
Token generation stopped because the content contains forbidden terms. |
PROHIBITED_CONTENT |
Token generation stopped for potentially containing prohibited content. |
SPII |
Token generation stopped because the content potentially contains Sensitive Personally Identifiable Information (SPII). |
MALFORMED_FUNCTION_CALL |
The function call generated by the model is invalid. |
ChatCompletionsRequest
Request message for [PredictionService.ChatCompletions]
Fields | |
---|---|
endpoint |
Required. The name of the endpoint requested to serve the prediction. Format: |
http_ |
Optional. The prediction input. Supports HTTP headers and arbitrary data payload. |
Citation
Source attributions for content.
Fields | |
---|---|
start_ |
Output only. Start index into the content. |
end_ |
Output only. End index into the content. |
uri |
Output only. Url reference of the attribution. |
title |
Output only. Title of the attribution. |
license |
Output only. License of the attribution. |
publication_ |
Output only. Publication date of the attribution. |
CitationMetadata
A collection of source attributions for a piece of content.
Fields | |
---|---|
citations[] |
Output only. List of citations. |
Claim
Claim that is extracted from the input text and facts that support it.
Fields | |
---|---|
fact_ |
Indexes of the facts supporting this claim. |
start_ |
Index in the input text where the claim starts (inclusive). |
end_ |
Index in the input text where the claim ends (exclusive). |
score |
Confidence score of this corroboration. |
CodeExecutionResult
Result of executing the [ExecutableCode].
Always follows a part
containing the [ExecutableCode].
Fields | |
---|---|
outcome |
Required. Outcome of the code execution. |
output |
Optional. Contains stdout when code execution is successful, stderr or other description otherwise. |
Outcome
Enumeration of possible outcomes of the code execution.
Enums | |
---|---|
OUTCOME_UNSPECIFIED |
Unspecified status. This value should not be used. |
OUTCOME_OK |
Code execution completed successfully. |
OUTCOME_FAILED |
Code execution finished but with a failure. stderr should contain the reason. |
OUTCOME_DEADLINE_EXCEEDED |
Code execution ran for too long, and was cancelled. There may or may not be a partial output present. |
CoherenceInput
Input for coherence metric.
Fields | |
---|---|
metric_ |
Required. Spec for coherence score metric. |
instance |
Required. Coherence instance. |
CoherenceInstance
Spec for coherence instance.
Fields | |
---|---|
prediction |
Required. Output of the evaluated model. |
CoherenceResult
Spec for coherence result.
Fields | |
---|---|
explanation |
Output only. Explanation for coherence score. |
score |
Output only. Coherence score. |
confidence |
Output only. Confidence for coherence score. |
CoherenceSpec
Spec for coherence score metric.
Fields | |
---|---|
version |
Optional. Which version to use for evaluation. |
CometInput
Input for Comet metric.
Fields | |
---|---|
metric_ |
Required. Spec for comet metric. |
instance |
Required. Comet instance. |
CometInstance
Spec for Comet instance - The fields used for evaluation are dependent on the comet version.
Fields | |
---|---|
prediction |
Required. Output of the evaluated model. |
reference |
Optional. Ground truth used to compare against the prediction. |
source |
Optional. Source text in original language. |
CometResult
Spec for Comet result - calculates the comet score for the given instance using the version specified in the spec.
Fields | |
---|---|
score |
Output only. Comet score. Range depends on version. |
CometSpec
Spec for Comet metric.
Fields | |
---|---|
source_ |
Optional. Source language in BCP-47 format. |
target_ |
Optional. Target language in BCP-47 format. Covers both prediction and reference. |
version |
Required. Which version to use for evaluation. |
CometVersion
Comet version options.
Enums | |
---|---|
COMET_VERSION_UNSPECIFIED |
Comet version unspecified. |
COMET_22_SRC_REF |
Comet 22 for translation + source + reference (source-reference-combined). |
Content
The base structured datatype containing multi-part content of a message.
A Content
includes a role
field designating the producer of the Content
and a parts
field containing multi-part data that contains the content of the message turn.
Fields | |
---|---|
role |
Optional. The producer of the content. Must be either 'user' or 'model'. Useful to set for multi-turn conversations, otherwise can be left blank or unset. |
parts[] |
Required. Ordered |
CorpusStatus
RagCorpus status.
Fields | |
---|---|
state |
Output only. RagCorpus life state. |
error_ |
Output only. Only when the |
State
RagCorpus life state.
Enums | |
---|---|
UNKNOWN |
This state is not supposed to happen. |
INITIALIZED |
RagCorpus resource entry is initialized, but hasn't done validation. |
ACTIVE |
RagCorpus is provisioned successfully and is ready to serve. |
ERROR |
RagCorpus is in a problematic situation. See error_message field for details. |
CorroborateContentRequest
Request message for CorroborateContent.
Fields | |
---|---|
parent |
Required. The resource name of the Location from which to corroborate text. The users must have permission to make a call in the project. Format: |
facts[] |
Optional. Facts used to generate the text can also be used to corroborate the text. |
parameters |
Optional. Parameters that can be set to override default settings per request. |
content |
Optional. Input content to corroborate, only text format is supported for now. |
Parameters
Parameters that can be overrided per request.
Fields | |
---|---|
citation_ |
Optional. Only return claims with citation score larger than the threshold. |
CorroborateContentResponse
Response message for CorroborateContent.
Fields | |
---|---|
claims[] |
Claims that are extracted from the input content and facts that support the claims. |
corroboration_ |
Confidence score of corroborating content. Value is [0,1] with 1 is the most confidence. |
CountTokensRequest
Request message for PredictionService.CountTokens
.
Fields | |
---|---|
endpoint |
Required. The name of the Endpoint requested to perform token counting. Format: |
model |
Optional. The name of the publisher model requested to serve the prediction. Format: |
instances[] |
Optional. The instances that are the input to token counting call. Schema is identical to the prediction schema of the underlying model. |
contents[] |
Optional. Input content. |
tools[] |
Optional. A list of A |
system_ |
Optional. The user provided system instructions for the model. Note: only text should be used in parts and content in each part will be in a separate paragraph. |
generation_ |
Optional. Generation config that the model will use to generate the response. |
CountTokensResponse
Response message for PredictionService.CountTokens
.
Fields | |
---|---|
total_ |
The total number of tokens counted across all instances from the request. |
total_ |
The total number of billable characters counted across all instances from the request. |
CreateCachedContentRequest
Request message for GenAiCacheService.CreateCachedContent
.
Fields | |
---|---|
parent |
Required. The parent resource where the cached content will be created |
cached_ |
Required. The cached content to create |
CreateRagCorpusOperationMetadata
Runtime operation information for VertexRagDataService.CreateRagCorpus
.
Fields | |
---|---|
generic_ |
The operation generic information. |
CreateRagCorpusRequest
Request message for VertexRagDataService.CreateRagCorpus
.
Fields | |
---|---|
parent |
Required. The resource name of the Location to create the RagCorpus in. Format: |
rag_ |
Required. The RagCorpus to create. |
CreateReasoningEngineOperationMetadata
Details of ReasoningEngineService.CreateReasoningEngine
operation.
Fields | |
---|---|
generic_ |
The common part of the operation metadata. |
CreateReasoningEngineRequest
Request message for ReasoningEngineService.CreateReasoningEngine
.
Fields | |
---|---|
parent |
Required. The resource name of the Location to create the ReasoningEngine in. Format: |
reasoning_ |
Required. The ReasoningEngine to create. |
CreateTuningJobRequest
Request message for GenAiTuningService.CreateTuningJob
.
Fields | |
---|---|
parent |
Required. The resource name of the Location to create the TuningJob in. Format: |
tuning_ |
Required. The TuningJob to create. |
DatasetDistribution
Distribution computed over a tuning dataset.
Fields | |
---|---|
sum |
Output only. Sum of a given population of values. |
min |
Output only. The minimum of the population values. |
max |
Output only. The maximum of the population values. |
mean |
Output only. The arithmetic mean of the values in the population. |
median |
Output only. The median of the values in the population. |
p5 |
Output only. The 5th percentile of the values in the population. |
p95 |
Output only. The 95th percentile of the values in the population. |
buckets[] |
Output only. Defines the histogram bucket. |
DistributionBucket
Dataset bucket used to create a histogram for the distribution given a population of values.
Fields | |
---|---|
count |
Output only. Number of values in the bucket. |
left |
Output only. Left bound of the bucket. |
right |
Output only. Right bound of the bucket. |
DatasetStats
Statistics computed over a tuning dataset.
Fields | |
---|---|
tuning_ |
Output only. Number of examples in the tuning dataset. |
total_ |
Output only. Number of tuning characters in the tuning dataset. |
total_ |
Output only. Number of billable characters in the tuning dataset. |
tuning_ |
Output only. Number of tuning steps for this Tuning Job. |
user_ |
Output only. Dataset distributions for the user input tokens. |
user_ |
Output only. Dataset distributions for the messages per example. |
user_ |
Output only. Sample user messages in the training dataset uri. |
user_ |
Output only. Dataset distributions for the user output tokens. |
DeleteCachedContentRequest
Request message for GenAiCacheService.DeleteCachedContent
.
Fields | |
---|---|
name |
Required. The resource name referring to the cached content |
DeleteExtensionRequest
Request message for ExtensionRegistryService.DeleteExtension
.
Fields | |
---|---|
name |
Required. The name of the Extension resource to be deleted. Format: |
DeleteOperationMetadata
Details of operations that perform deletes of any entities.
Fields | |
---|---|
generic_ |
The common part of the operation metadata. |
DeleteRagCorpusRequest
Request message for VertexRagDataService.DeleteRagCorpus
.
Fields | |
---|---|
name |
Required. The name of the RagCorpus resource to be deleted. Format: |
force |
Optional. If set to true, any RagFiles in this RagCorpus will also be deleted. Otherwise, the request will only work if the RagCorpus has no RagFiles. |
DeleteRagFileRequest
Request message for VertexRagDataService.DeleteRagFile
.
Fields | |
---|---|
name |
Required. The name of the RagFile resource to be deleted. Format: |
DeleteReasoningEngineRequest
Request message for ReasoningEngineService.DeleteReasoningEngine
.
Fields | |
---|---|
name |
Required. The name of the ReasoningEngine resource to be deleted. Format: |
DirectUploadSource
This type has no fields.
The input content is encapsulated and uploaded in the request.
DistillationDataStats
Statistics computed for datasets used for distillation.
Fields | |
---|---|
training_ |
Output only. Statistics computed for the training dataset. |
DistillationHyperParameters
Hyperparameters for Distillation.
Fields | |
---|---|
adapter_ |
Optional. Adapter size for distillation. |
epoch_ |
Optional. Number of complete passes the model makes over the entire training dataset during training. |
learning_ |
Optional. Multiplier for adjusting the default learning rate. |
DistillationSpec
Tuning Spec for Distillation.
Fields | |
---|---|
training_dataset_uri |
Deprecated. Cloud Storage path to file containing training dataset for tuning. The dataset must be formatted as a JSONL file. |
hyper_ |
Optional. Hyperparameters for Distillation. |
student_model |
The student model that is being tuned, e.g., "google/gemma-2b-1.1-it". Deprecated. Use base_model instead. |
pipeline_root_directory |
Deprecated. A path in a Cloud Storage bucket, which will be treated as the root output directory of the distillation pipeline. It is used by the system to generate the paths of output artifacts. |
Union field teacher_model . The teacher model that is being distilled from, e.g., "gemini-1.5-pro-002". teacher_model can be only one of the following: |
|
base_ |
The base teacher model that is being distilled, e.g., "gemini-1.0-pro-002". |
tuned_ |
The resource name of the Tuned teacher model. Format: |
validation_ |
Optional. Cloud Storage path to file containing validation dataset for tuning. The dataset must be formatted as a JSONL file. |
DynamicRetrievalConfig
Describes the options to customize dynamic retrieval.
Fields | |
---|---|
mode |
The mode of the predictor to be used in dynamic retrieval. |
dynamic_ |
Optional. The threshold to be used in dynamic retrieval. If not set, a system default value is used. |
Mode
The mode of the predictor to be used in dynamic retrieval.
Enums | |
---|---|
MODE_UNSPECIFIED |
Always trigger retrieval. |
MODE_DYNAMIC |
Run retrieval only when system decides it is necessary. |
EncryptionSpec
Represents a customer-managed encryption key spec that can be applied to a top-level resource.
Fields | |
---|---|
kms_ |
Required. The Cloud KMS resource identifier of the customer managed encryption key used to protect a resource. Has the form: |
EvaluateInstancesRequest
Request message for EvaluationService.EvaluateInstances.
Fields | |
---|---|
location |
Required. The resource name of the Location to evaluate the instances. Format: |
Union field metric_inputs . Instances and specs for evaluation metric_inputs can be only one of the following: |
|
exact_ |
Auto metric instances. Instances and metric spec for exact match metric. |
bleu_ |
Instances and metric spec for bleu metric. |
rouge_ |
Instances and metric spec for rouge metric. |
fluency_ |
LLM-based metric instance. General text generation metrics, applicable to other categories. Input for fluency metric. |
coherence_ |
Input for coherence metric. |
safety_ |
Input for safety metric. |
groundedness_ |
Input for groundedness metric. |
fulfillment_ |
Input for fulfillment metric. |
summarization_ |
Input for summarization quality metric. |
pairwise_ |
Input for pairwise summarization quality metric. |
summarization_ |
Input for summarization helpfulness metric. |
summarization_ |
Input for summarization verbosity metric. |
question_ |
Input for question answering quality metric. |
pairwise_ |
Input for pairwise question answering quality metric. |
question_ |
Input for question answering relevance metric. |
question_ |
Input for question answering helpfulness metric. |
question_ |
Input for question answering correctness metric. |
pointwise_ |
Input for pointwise metric. |
pairwise_ |
Input for pairwise metric. |
tool_ |
Tool call metric instances. Input for tool call valid metric. |
tool_ |
Input for tool name match metric. |
tool_ |
Input for tool parameter key match metric. |
tool_ |
Input for tool parameter key value match metric. |
comet_ |
Translation metrics. Input for Comet metric. |
metricx_ |
Input for Metricx metric. |
trajectory_ |
Input for trajectory exact match metric. |
trajectory_ |
Input for trajectory in order match metric. |
trajectory_ |
Input for trajectory match any order metric. |
trajectory_ |
Input for trajectory precision metric. |
trajectory_ |
Input for trajectory recall metric. |
trajectory_ |
Input for trajectory single tool use metric. |
EvaluateInstancesResponse
Response message for EvaluationService.EvaluateInstances.
Fields | |
---|---|
Union field evaluation_results . Evaluation results will be served in the same order as presented in EvaluationRequest.instances. evaluation_results can be only one of the following: |
|
exact_ |
Auto metric evaluation results. Results for exact match metric. |
bleu_ |
Results for bleu metric. |
rouge_ |
Results for rouge metric. |
fluency_ |
LLM-based metric evaluation result. General text generation metrics, applicable to other categories. Result for fluency metric. |
coherence_ |
Result for coherence metric. |
safety_ |
Result for safety metric. |
groundedness_ |
Result for groundedness metric. |
fulfillment_ |
Result for fulfillment metric. |
summarization_ |
Summarization only metrics. Result for summarization quality metric. |
pairwise_ |
Result for pairwise summarization quality metric. |
summarization_ |
Result for summarization helpfulness metric. |
summarization_ |
Result for summarization verbosity metric. |
question_ |
Question answering only metrics. Result for question answering quality metric. |
pairwise_ |
Result for pairwise question answering quality metric. |
question_ |
Result for question answering relevance metric. |
question_ |
Result for question answering helpfulness metric. |
question_ |
Result for question answering correctness metric. |
pointwise_ |
Generic metrics. Result for pointwise metric. |
pairwise_ |
Result for pairwise metric. |
tool_ |
Tool call metrics. Results for tool call valid metric. |
tool_ |
Results for tool name match metric. |
tool_ |
Results for tool parameter key match metric. |
tool_ |
Results for tool parameter key value match metric. |
comet_ |
Translation metrics. Result for Comet metric. |
metricx_ |
Result for Metricx metric. |
trajectory_ |
Result for trajectory exact match metric. |
trajectory_ |
Result for trajectory in order match metric. |
trajectory_ |
Result for trajectory any order match metric. |
trajectory_ |
Result for trajectory precision metric. |
trajectory_ |
Results for trajectory recall metric. |
trajectory_ |
Results for trajectory single tool use metric. |
ExactMatchInput
Input for exact match metric.
Fields | |
---|---|
metric_ |
Required. Spec for exact match metric. |
instances[] |
Required. Repeated exact match instances. |
ExactMatchInstance
Spec for exact match instance.
Fields | |
---|---|
prediction |
Required. Output of the evaluated model. |
reference |
Required. Ground truth used to compare against the prediction. |
ExactMatchMetricValue
Exact match metric value for an instance.
Fields | |
---|---|
score |
Output only. Exact match score. |
ExactMatchResults
Results for exact match metric.
Fields | |
---|---|
exact_ |
Output only. Exact match metric values. |
ExactMatchSpec
This type has no fields.
Spec for exact match metric - returns 1 if prediction and reference exactly matches, otherwise 0.
ExecutableCode
Code generated by the model that is meant to be executed, and the result returned to the model.
Generated when using the [FunctionDeclaration] tool and [FunctionCallingConfig] mode is set to [Mode.CODE].
Fields | |
---|---|
language |
Required. Programming language of the |
code |
Required. The code to be executed. |
Language
Supported programming languages for the generated code.
Enums | |
---|---|
LANGUAGE_UNSPECIFIED |
Unspecified language. This value should not be used. |
PYTHON |
Python >= 3.10, with numpy and simpy available. |
ExecuteExtensionRequest
Request message for ExtensionExecutionService.ExecuteExtension
.
Fields | |
---|---|
name |
Required. Name (identifier) of the extension; Format: |
operation_ |
Required. The desired ID of the operation to be executed in this extension as defined in |
operation_ |
Optional. Request parameters that will be used for executing this operation. The struct should be in a form of map with param name as the key and actual param value as the value. E.g. If this operation requires a param "name" to be set to "abc". you can set this to something like {"name": "abc"}. |
runtime_ |
Optional. Auth config provided at runtime to override the default value in [Extension.manifest.auth_config][]. The AuthConfig.auth_type should match the value in [Extension.manifest.auth_config][]. |
ExecuteExtensionResponse
Response message for ExtensionExecutionService.ExecuteExtension
.
Fields | |
---|---|
content |
Response content from the extension. The content should be conformant to the response.content schema in the extension's manifest/OpenAPI spec. |
Extension
Extensions are tools for large language models to access external data, run computations, etc.
Fields | |
---|---|
name |
Identifier. The resource name of the Extension. |
display_ |
Required. The display name of the Extension. The name can be up to 128 characters long and can consist of any UTF-8 characters. |
description |
Optional. The description of the Extension. |
create_ |
Output only. Timestamp when this Extension was created. |
update_ |
Output only. Timestamp when this Extension was most recently updated. |
etag |
Optional. Used to perform consistent read-modify-write updates. If not set, a blind "overwrite" update happens. |
manifest |
Required. Manifest of the Extension. |
extension_ |
Output only. Supported operations. |
runtime_ |
Optional. Runtime config controlling the runtime behavior of this Extension. |
tool_ |
Optional. Examples to illustrate the usage of the extension as a tool. |
private_ |
Optional. The PrivateServiceConnect config for the extension. If specified, the service endpoints associated with the Extension should be registered with private network access in the provided Service Directory. If the service contains more than one endpoint with a network, the service will arbitrarilty choose one of the endpoints to use for extension execution. |
satisfies_ |
Output only. Reserved for future use. |
satisfies_ |
Output only. Reserved for future use. |
ExtensionManifest
Manifest spec of an Extension needed for runtime execution.
Fields | |
---|---|
name |
Required. Extension name shown to the LLM. The name can be up to 128 characters long. |
description |
Required. The natural language description shown to the LLM. It should describe the usage of the extension, and is essential for the LLM to perform reasoning. e.g., if the extension is a data store, you can let the LLM know what data it contains. |
api_ |
Required. Immutable. The API specification shown to the LLM. |
auth_ |
Required. Immutable. Type of auth supported by this extension. |
ApiSpec
The API specification shown to the LLM.
Fields | |
---|---|
Union field
|
|
open_ |
The API spec in Open API standard and YAML format. |
open_ |
Cloud Storage URI pointing to the OpenAPI spec. |
ExtensionOperation
Operation of an extension.
Fields | |
---|---|
operation_ |
Operation ID that uniquely identifies the operations among the extension. See: "Operation Object" in https://swagger.io/specification/. This field is parsed from the OpenAPI spec. For HTTP extensions, if it does not exist in the spec, we will generate one from the HTTP method and path. |
function_ |
Output only. Structured representation of a function declaration as defined by the OpenAPI Spec. |
ExtensionPrivateServiceConnectConfig
PrivateExtensionConfig configuration for the extension.
Fields | |
---|---|
service_ |
Required. The Service Directory resource name in which the service endpoints associated to the extension are registered. Format:
|
Fact
The fact used in grounding.
Fields | |
---|---|
query |
Query that is used to retrieve this fact. |
title |
If present, it refers to the title of this fact. |
uri |
If present, this uri links to the source of the fact. |
summary |
If present, the summary/snippet of the fact. |
vector_distance |
If present, the distance between the query vector and this fact vector. |
score |
If present, according to the underlying Vector DB and the selected metric type, the score can be either the distance or the similarity between the query and the fact and its range depends on the metric type. For example, if the metric type is COSINE_DISTANCE, it represents the distance between the query and the fact. The larger the distance, the less relevant the fact is to the query. The range is [0, 2], while 0 means the most relevant and 2 means the least relevant. |
FetchPredictOperationRequest
Request message for PredictionService.FetchPredictOperation
.
Fields | |
---|---|
endpoint |
Required. The name of the Endpoint requested to serve the prediction. Format: |
operation_ |
Required. The server-assigned name for the operation. |
FileData
URI based data.
Fields | |
---|---|
mime_ |
Required. The IANA standard MIME type of the source data. |
file_ |
Required. URI. |
FileStatus
RagFile status.
Fields | |
---|---|
state |
Output only. RagFile state. |
error_ |
Output only. Only when the |
State
RagFile state.
Enums | |
---|---|
STATE_UNSPECIFIED |
RagFile state is unspecified. |
ACTIVE |
RagFile resource has been created and indexed successfully. |
ERROR |
RagFile resource is in a problematic state. See error_message field for details. |
FluencyInput
Input for fluency metric.
Fields | |
---|---|
metric_ |
Required. Spec for fluency score metric. |
instance |
Required. Fluency instance. |
FluencyInstance
Spec for fluency instance.
Fields | |
---|---|
prediction |
Required. Output of the evaluated model. |
FluencyResult
Spec for fluency result.
Fields | |
---|---|
explanation |
Output only. Explanation for fluency score. |
score |
Output only. Fluency score. |
confidence |
Output only. Confidence for fluency score. |
FluencySpec
Spec for fluency score metric.
Fields | |
---|---|
version |
Optional. Which version to use for evaluation. |
FulfillmentInput
Input for fulfillment metric.
Fields | |
---|---|
metric_ |
Required. Spec for fulfillment score metric. |
instance |
Required. Fulfillment instance. |
FulfillmentInstance
Spec for fulfillment instance.
Fields | |
---|---|
prediction |
Required. Output of the evaluated model. |
instruction |
Required. Inference instruction prompt to compare prediction with. |
FulfillmentResult
Spec for fulfillment result.
Fields | |
---|---|
explanation |
Output only. Explanation for fulfillment score. |
score |
Output only. Fulfillment score. |
confidence |
Output only. Confidence for fulfillment score. |
FulfillmentSpec
Spec for fulfillment metric.
Fields | |
---|---|
version |
Optional. Which version to use for evaluation. |
FunctionCall
A predicted [FunctionCall] returned from the model that contains a string representing the [FunctionDeclaration.name] and a structured JSON object containing the parameters and their values.
Fields | |
---|---|
name |
Required. The name of the function to call. Matches [FunctionDeclaration.name]. |
args |
Optional. Required. The function parameters and values in JSON object format. See [FunctionDeclaration.parameters] for parameter details. |
FunctionCallingConfig
Function calling config.
Fields | |
---|---|
mode |
Optional. Function calling mode. |
allowed_ |
Optional. Function names to call. Only set when the Mode is ANY. Function names should match [FunctionDeclaration.name]. With mode set to ANY, model will predict a function call from the set of function names provided. |
Mode
Function calling mode.
Enums | |
---|---|
MODE_UNSPECIFIED |
Unspecified function calling mode. This value should not be used. |
AUTO |
Default model behavior, model decides to predict either function calls or natural language response. |
ANY |
Model is constrained to always predicting function calls only. If "allowed_function_names" are set, the predicted function calls will be limited to any one of "allowed_function_names", else the predicted function calls will be any one of the provided "function_declarations". |
NONE |
Model will not predict any function calls. Model behavior is same as when not passing any function declarations. |
FunctionDeclaration
Structured representation of a function declaration as defined by the OpenAPI 3.0 specification. Included in this declaration are the function name, description, parameters and response type. This FunctionDeclaration is a representation of a block of code that can be used as a Tool
by the model and executed by the client.
Fields | |
---|---|
name |
Required. The name of the function to call. Must start with a letter or an underscore. Must be a-z, A-Z, 0-9, or contain underscores, dots and dashes, with a maximum length of 64. |
description |
Optional. Description and purpose of the function. Model uses it to decide how and whether to call the function. |
parameters |
Optional. Describes the parameters to this function in JSON Schema Object format. Reflects the Open API 3.03 Parameter Object. string Key: the name of the parameter. Parameter names are case sensitive. Schema Value: the Schema defining the type used for the parameter. For function with no parameters, this can be left unset. Parameter names must start with a letter or an underscore and must only contain chars a-z, A-Z, 0-9, or underscores with a maximum length of 64. Example with 1 required and 1 optional parameter: type: OBJECT properties: param1: type: STRING param2: type: INTEGER required: - param1 |
response |
Optional. Describes the output from this function in JSON Schema format. Reflects the Open API 3.03 Response Object. The Schema defines the type used for the response value of the function. |
FunctionResponse
The result output from a [FunctionCall] that contains a string representing the [FunctionDeclaration.name] and a structured JSON object containing any output from the function is used as context to the model. This should contain the result of a [FunctionCall] made based on model prediction.
Fields | |
---|---|
name |
Required. The name of the function to call. Matches [FunctionDeclaration.name] and [FunctionCall.name]. |
response |
Required. The function response in JSON object format. Use "output" key to specify function output and "error" key to specify error details (if any). If "output" and "error" keys are not specified, then whole "response" is treated as function output. |
GcsDestination
The Google Cloud Storage location where the output is to be written to.
Fields | |
---|---|
output_ |
Required. Google Cloud Storage URI to output directory. If the uri doesn't end with '/', a '/' will be automatically appended. The directory is created if it doesn't exist. |
GcsSource
The Google Cloud Storage location for the input content.
Fields | |
---|---|
uris[] |
Required. Google Cloud Storage URI(-s) to the input file(s). May contain wildcards. For more information on wildcards, see https://cloud.google.com/storage/docs/gsutil/addlhelp/WildcardNames. |
GenerateContentRequest
Request message for [PredictionService.GenerateContent].
Fields | |
---|---|
model |
Required. The fully qualified name of the publisher model or tuned model endpoint to use. Publisher model format: Tuned model endpoint format: |
contents[] |
Required. The content of the current conversation with the model. For single-turn queries, this is a single instance. For multi-turn queries, this is a repeated field that contains conversation history + latest request. |
cached_ |
Optional. The name of the cached content used as context to serve the prediction. Note: only used in explicit caching, where users can have control over caching (e.g. what content to cache) and enjoy guaranteed cost savings. Format: |
tools[] |
Optional. A list of A |
tool_ |
Optional. Tool config. This config is shared for all tools provided in the request. |
labels |
Optional. The labels with user-defined metadata for the request. It is used for billing and reporting only. Label keys and values can be no longer than 63 characters (Unicode codepoints) and can only contain lowercase letters, numeric characters, underscores, and dashes. International characters are allowed. Label values are optional. Label keys must start with a letter. |
safety_ |
Optional. Per request settings for blocking unsafe content. Enforced on GenerateContentResponse.candidates. |
generation_ |
Optional. Generation config. |
system_ |
Optional. The user provided system instructions for the model. Note: only text should be used in parts and content in each part will be in a separate paragraph. |
GenerateContentResponse
Response message for [PredictionService.GenerateContent].
Fields | |
---|---|
candidates[] |
Output only. Generated candidates. |
model_ |
Output only. The model version used to generate the response. |
prompt_ |
Output only. Content filter results for a prompt sent in the request. Note: Sent only in the first stream chunk. Only happens when no candidates were generated due to content violations. |
usage_ |
Usage metadata about the response(s). |
PromptFeedback
Content filter results for a prompt sent in the request.
Fields | |
---|---|
block_ |
Output only. Blocked reason. |
safety_ |
Output only. Safety ratings. |
block_ |
Output only. A readable block reason message. |
BlockedReason
Blocked reason enumeration.
Enums | |
---|---|
BLOCKED_REASON_UNSPECIFIED |
Unspecified blocked reason. |
SAFETY |
Candidates blocked due to safety. |
OTHER |
Candidates blocked due to other reason. |
BLOCKLIST |
Candidates blocked due to the terms which are included from the terminology blocklist. |
PROHIBITED_CONTENT |
Candidates blocked due to prohibited content. |
UsageMetadata
Usage metadata about response(s).
Fields | |
---|---|
prompt_ |
Number of tokens in the request. When |
candidates_ |
Number of tokens in the response(s). |
total_ |
Total token count for prompt and response candidates. |
cached_ |
Output only. Number of tokens in the cached part in the input (the cached content). |
GenerationConfig
Generation config.
Fields | |
---|---|
stop_ |
Optional. Stop sequences. |
response_ |
Optional. Output response mimetype of the generated candidate text. Supported mimetype: - |
response_ |
Optional. The modalities of the response. |
temperature |
Optional. Controls the randomness of predictions. |
top_ |
Optional. If specified, nucleus sampling will be used. |
top_ |
Optional. If specified, top-k sampling will be used. |
candidate_ |
Optional. Number of candidates to generate. |
max_ |
Optional. The maximum number of output tokens to generate per message. |
response_ |
Optional. If true, export the logprobs results in response. |
logprobs |
Optional. Logit probabilities. |
presence_ |
Optional. Positive penalties. |
frequency_ |
Optional. Frequency penalties. |
seed |
Optional. Seed. |
response_ |
Optional. The |
routing_ |
Optional. Routing configuration. |
audio_ |
Optional. If enabled, audio timestamp will be included in the request to the model. |
media_ |
Optional. If specified, the media resolution specified will be used. |
speech_ |
Optional. The speech generation config. |
MediaResolution
Media resolution for the input media.
Enums | |
---|---|
MEDIA_RESOLUTION_UNSPECIFIED |
Media resolution has not been set. |
MEDIA_RESOLUTION_LOW |
Media resolution set to low (64 tokens). |
MEDIA_RESOLUTION_MEDIUM |
Media resolution set to medium (256 tokens). |
MEDIA_RESOLUTION_HIGH |
Media resolution set to high (zoomed reframing with 256 tokens). |
Modality
The modalities of the response.
Enums | |
---|---|
MODALITY_UNSPECIFIED |
Unspecified modality. Will be processed as text. |
TEXT |
Text modality. |
IMAGE |
Image modality. |
AUDIO |
Audio modality. |
RoutingConfig
The configuration for routing the request to a specific model.
Fields | |
---|---|
Union field routing_config . Routing mode. routing_config can be only one of the following: |
|
auto_ |
Automated routing. |
manual_ |
Manual routing. |
AutoRoutingMode
When automated routing is specified, the routing will be determined by the pretrained routing model and customer provided model routing preference.
Fields | |
---|---|
model_ |
The model routing preference. |
ModelRoutingPreference
The model routing preference.
Enums | |
---|---|
UNKNOWN |
Unspecified model routing preference. |
PRIORITIZE_QUALITY |
Prefer higher quality over low cost. |
BALANCED |
Balanced model routing preference. |
PRIORITIZE_COST |
Prefer lower cost over higher quality. |
ManualRoutingMode
When manual routing is set, the specified model will be used directly.
Fields | |
---|---|
model_ |
The model name to use. Only the public LLM models are accepted. e.g. 'gemini-1.5-pro-001'. |
GenericOperationMetadata
Generic Metadata shared by all operations.
Fields | |
---|---|
partial_ |
Output only. Partial failures encountered. E.g. single files that couldn't be read. This field should never exceed 20 entries. Status details field will contain standard Google Cloud error details. |
create_ |
Output only. Time when the operation was created. |
update_ |
Output only. Time when the operation was updated for the last time. If the operation has finished (successfully or not), this is the finish time. |
GetCacheConfigRequest
Request message for getting a cache config.
Fields | |
---|---|
name |
Required. Name of the cache config. Format: - |
GetCachedContentRequest
Request message for GenAiCacheService.GetCachedContent
.
Fields | |
---|---|
name |
Required. The resource name referring to the cached content |
GetExtensionRequest
Request message for ExtensionRegistryService.GetExtension
.
Fields | |
---|---|
name |
Required. The name of the Extension resource. Format: |
GetRagCorpusRequest
Request message for VertexRagDataService.GetRagCorpus
Fields | |
---|---|
name |
Required. The name of the RagCorpus resource. Format: |
GetRagFileRequest
Request message for VertexRagDataService.GetRagFile
Fields | |
---|---|
name |
Required. The name of the RagFile resource. Format: |
GetReasoningEngineRequest
Request message for ReasoningEngineService.GetReasoningEngine
.
Fields | |
---|---|
name |
Required. The name of the ReasoningEngine resource. Format: |
GetTuningJobRequest
Request message for GenAiTuningService.GetTuningJob
.
Fields | |
---|---|
name |
Required. The name of the TuningJob resource. Format: |
GoogleDriveSource
The Google Drive location for the input content.
Fields | |
---|---|
resource_ |
Required. Google Drive resource IDs. |
ResourceId
The type and ID of the Google Drive resource.
Fields | |
---|---|
resource_ |
Required. The type of the Google Drive resource. |
resource_ |
Required. The ID of the Google Drive resource. |
ResourceType
The type of the Google Drive resource.
Enums | |
---|---|
RESOURCE_TYPE_UNSPECIFIED |
Unspecified resource type. |
RESOURCE_TYPE_FILE |
File resource type. |
RESOURCE_TYPE_FOLDER |
Folder resource type. |
GoogleSearchRetrieval
Tool to retrieve public web data for grounding, powered by Google.
Fields | |
---|---|
dynamic_ |
Specifies the dynamic retrieval configuration for the given source. |
GroundednessInput
Input for groundedness metric.
Fields | |
---|---|
metric_ |
Required. Spec for groundedness metric. |
instance |
Required. Groundedness instance. |
GroundednessInstance
Spec for groundedness instance.
Fields | |
---|---|
prediction |
Required. Output of the evaluated model. |
context |
Required. Background information provided in context used to compare against the prediction. |
GroundednessResult
Spec for groundedness result.
Fields | |
---|---|
explanation |
Output only. Explanation for groundedness score. |
score |
Output only. Groundedness score. |
confidence |
Output only. Confidence for groundedness score. |
GroundednessSpec
Spec for groundedness metric.
Fields | |
---|---|
version |
Optional. Which version to use for evaluation. |
GroundingChunk
Grounding chunk.
Fields | |
---|---|
Union field chunk_type . Chunk type. chunk_type can be only one of the following: |
|
web |
Grounding chunk from the web. |
retrieved_ |
Grounding chunk from context retrieved by the retrieval tools. |
RetrievedContext
Chunk from context retrieved by the retrieval tools.
Fields | |
---|---|
uri |
URI reference of the attribution. |
title |
Title of the attribution. |
text |
Text of the attribution. |
Web
Chunk from the web.
Fields | |
---|---|
uri |
URI reference of the chunk. |
title |
Title of the chunk. |
GroundingMetadata
Metadata returned to client when grounding is enabled.
Fields | |
---|---|
web_ |
Optional. Web search queries for the following-up web search. |
retrieval_ |
Optional. Queries executed by the retrieval tools. |
grounding_ |
List of supporting references retrieved from specified grounding source. |
grounding_ |
Optional. List of grounding support. |
search_ |
Optional. Google search entry for the following-up web searches. |
retrieval_ |
Optional. Output only. Retrieval metadata. |
GroundingSupport
Grounding support.
Fields | |
---|---|
grounding_ |
A list of indices (into 'grounding_chunk') specifying the citations associated with the claim. For instance [1,3,4] means that grounding_chunk[1], grounding_chunk[3], grounding_chunk[4] are the retrieved content attributed to the claim. |
confidence_ |
Confidence score of the support references. Ranges from 0 to 1. 1 is the most confident. This list must have the same size as the grounding_chunk_indices. |
segment |
Segment of the content this support belongs to. |
HarmCategory
Harm categories that will block the content.
Enums | |
---|---|
HARM_CATEGORY_UNSPECIFIED |
The harm category is unspecified. |
HARM_CATEGORY_HATE_SPEECH |
The harm category is hate speech. |
HARM_CATEGORY_DANGEROUS_CONTENT |
The harm category is dangerous content. |
HARM_CATEGORY_HARASSMENT |
The harm category is harassment. |
HARM_CATEGORY_SEXUALLY_EXPLICIT |
The harm category is sexually explicit content. |
HARM_CATEGORY_CIVIC_INTEGRITY |
The harm category is civic integrity. |
HttpElementLocation
Enum of location an HTTP element can be.
Enums | |
---|---|
HTTP_IN_UNSPECIFIED |
|
HTTP_IN_QUERY |
Element is in the HTTP request query. |
HTTP_IN_HEADER |
Element is in the HTTP request header. |
HTTP_IN_PATH |
Element is in the HTTP request path. |
HTTP_IN_BODY |
Element is in the HTTP request body. |
HTTP_IN_COOKIE |
Element is in the HTTP request cookie. |
ImportExtensionOperationMetadata
Details of ExtensionRegistryService.ImportExtension
operation.
Fields | |
---|---|
generic_ |
The common part of the operation metadata. |
ImportExtensionRequest
Request message for ExtensionRegistryService.ImportExtension
.
Fields | |
---|---|
parent |
Required. The resource name of the Location to import the Extension in. Format: |
extension |
Required. The Extension to import. |
ImportRagFilesConfig
Config for importing RagFiles.
Fields | |
---|---|
rag_file_chunking_config |
Specifies the size and overlap of chunks after importing RagFiles. |
rag_ |
Specifies the transformation config for RagFiles. |
rag_ |
Optional. Specifies the parsing config for RagFiles. RAG will use the default parser if this field is not set. |
max_ |
Optional. The max number of queries per minute that this job is allowed to make to the embedding model specified on the corpus. This value is specific to this job and not shared across other import jobs. Consult the Quotas page on the project to set an appropriate value here. If unspecified, a default value of 1,000 QPM would be used. |
Union field import_source . The source of the import. import_source can be only one of the following: |
|
gcs_ |
Google Cloud Storage location. Supports importing individual files as well as entire Google Cloud Storage directories. Sample formats: - |
google_ |
Google Drive location. Supports importing individual files as well as Google Drive folders. |
slack_ |
Slack channels with their corresponding access tokens. |
jira_ |
Jira queries with their corresponding authentication. |
share_ |
SharePoint sources. |
Union field partial_failure_sink . Optional. If provided, all partial failures are written to the sink. Deprecated. Prefer to use the import_result_sink . partial_failure_sink can be only one of the following: |
|
partial_failure_gcs_sink |
The Cloud Storage path to write partial failures to. Deprecated. Prefer to use |
partial_failure_bigquery_sink |
The BigQuery destination to write partial failures to. It should be a bigquery table resource name (e.g. "bq://projectId.bqDatasetId.bqTableId"). The dataset must exist. If the table does not exist, it will be created with the expected schema. If the table exists, the schema will be validated and data will be added to this existing table. Deprecated. Prefer to use |
ImportRagFilesOperationMetadata
Runtime operation information for VertexRagDataService.ImportRagFiles
.
Fields | |
---|---|
generic_ |
The operation generic information. |
rag_ |
The resource ID of RagCorpus that this operation is executed on. |
import_ |
Output only. The config that was passed in the ImportRagFilesRequest. |
progress_ |
The progress percentage of the operation. Value is in the range [0, 100]. This percentage is calculated as follows: progress_percentage = 100 * (successes + failures + skips) / total |
ImportRagFilesRequest
Request message for VertexRagDataService.ImportRagFiles
.
Fields | |
---|---|
parent |
Required. The name of the RagCorpus resource into which to import files. Format: |
import_ |
Required. The config for the RagFiles to be synced and imported into the RagCorpus. |
ImportRagFilesResponse
Response message for VertexRagDataService.ImportRagFiles
.
Fields | |
---|---|
imported_ |
The number of RagFiles that had been imported into the RagCorpus. |
failed_ |
The number of RagFiles that had failed while importing into the RagCorpus. |
skipped_ |
The number of RagFiles that was skipped while importing into the RagCorpus. |
Union field partial_failure_sink . The location into which the partial failures were written. partial_failure_sink can be only one of the following: |
|
partial_ |
The Google Cloud Storage path into which the partial failures were written. |
partial_ |
The BigQuery table into which the partial failures were written. |
JiraSource
The Jira source for the ImportRagFilesRequest.
Fields | |
---|---|
jira_ |
Required. The Jira queries. |
JiraQueries
JiraQueries contains the Jira queries and corresponding authentication.
Fields | |
---|---|
projects[] |
A list of Jira projects to import in their entirety. |
custom_ |
A list of custom Jira queries to import. For information about JQL (Jira Query Language), see https://support.atlassian.com/jira-service-management-cloud/docs/use-advanced-search-with-jira-query-language-jql/ |
email |
Required. The Jira email address. |
server_ |
Required. The Jira server URI. |
api_ |
Required. The SecretManager secret version resource name (e.g. projects/{project}/secrets/{secret}/versions/{version}) storing the Jira API key. See Manage API tokens for your Atlassian account. |
JobState
Describes the state of a job.
Enums | |
---|---|
JOB_STATE_UNSPECIFIED |
The job state is unspecified. |
JOB_STATE_QUEUED |
The job has been just created or resumed and processing has not yet begun. |
JOB_STATE_PENDING |
The service is preparing to run the job. |
JOB_STATE_RUNNING |
The job is in progress. |
JOB_STATE_SUCCEEDED |
The job completed successfully. |
JOB_STATE_FAILED |
The job failed. |
JOB_STATE_CANCELLING |
The job is being cancelled. From this state the job may only go to either JOB_STATE_SUCCEEDED , JOB_STATE_FAILED or JOB_STATE_CANCELLED . |
JOB_STATE_CANCELLED |
The job has been cancelled. |
JOB_STATE_PAUSED |
The job has been stopped, and can be resumed. |
JOB_STATE_EXPIRED |
The job has expired. |
JOB_STATE_UPDATING |
The job is being updated. Only jobs in the RUNNING state can be updated. After updating, the job goes back to the RUNNING state. |
JOB_STATE_PARTIALLY_SUCCEEDED |
The job is partially succeeded, some results may be missing due to errors. |
ListCachedContentsRequest
Request to list CachedContents.
Fields | |
---|---|
parent |
Required. The parent, which owns this collection of cached contents. |
page_ |
Optional. The maximum number of cached contents to return. The service may return fewer than this value. If unspecified, some default (under maximum) number of items will be returned. The maximum value is 1000; values above 1000 will be coerced to 1000. |
page_ |
Optional. A page token, received from a previous When paginating, all other parameters provided to |
ListCachedContentsResponse
Response with a list of CachedContents.
Fields | |
---|---|
cached_ |
List of cached contents. |
next_ |
A token, which can be sent as |
ListExtensionsRequest
Request message for ExtensionRegistryService.ListExtensions
.
Fields | |
---|---|
parent |
Required. The resource name of the Location to list the Extensions from. Format: |
filter |
Optional. The standard list filter. Supported fields: * More detail in AIP-160. |
page_ |
Optional. The standard list page size. |
page_ |
Optional. The standard list page token. |
order_ |
Optional. A comma-separated list of fields to order by, sorted in ascending order. Use "desc" after a field name for descending. Supported fields: * Example: |
ListExtensionsResponse
Response message for ExtensionRegistryService.ListExtensions
Fields | |
---|---|
extensions[] |
List of Extension in the requested page. |
next_ |
A token to retrieve the next page of results. Pass to |
ListRagCorporaRequest
Request message for VertexRagDataService.ListRagCorpora
.
Fields | |
---|---|
parent |
Required. The resource name of the Location from which to list the RagCorpora. Format: |
page_ |
Optional. The standard list page size. |
page_ |
Optional. The standard list page token. Typically obtained via |
ListRagCorporaResponse
Response message for VertexRagDataService.ListRagCorpora
.
Fields | |
---|---|
rag_ |
List of RagCorpora in the requested page. |
next_ |
A token to retrieve the next page of results. Pass to |
ListRagFilesRequest
Request message for VertexRagDataService.ListRagFiles
.
Fields | |
---|---|
parent |
Required. The resource name of the RagCorpus from which to list the RagFiles. Format: |
page_ |
Optional. The standard list page size. |
page_ |
Optional. The standard list page token. Typically obtained via |
ListRagFilesResponse
Response message for VertexRagDataService.ListRagFiles
.
Fields | |
---|---|
rag_ |
List of RagFiles in the requested page. |
next_ |
A token to retrieve the next page of results. Pass to |
ListReasoningEnginesRequest
Request message for ReasoningEngineService.ListReasoningEngines
.
Fields | |
---|---|
parent |
Required. The resource name of the Location to list the ReasoningEngines from. Format: |
filter |
Optional. The standard list filter. More detail in AIP-160. |
page_ |
Optional. The standard list page size. |
page_ |
Optional. The standard list page token. |
ListReasoningEnginesResponse
Response message for ReasoningEngineService.ListReasoningEngines
Fields | |
---|---|
reasoning_ |
List of ReasoningEngines in the requested page. |
next_ |
A token to retrieve the next page of results. Pass to |
ListTuningJobsRequest
Request message for GenAiTuningService.ListTuningJobs
.
Fields | |
---|---|
parent |
Required. The resource name of the Location to list the TuningJobs from. Format: |
filter |
Optional. The standard list filter. |
page_ |
Optional. The standard list page size. |
page_ |
Optional. The standard list page token. Typically obtained via |
ListTuningJobsResponse
Response message for GenAiTuningService.ListTuningJobs
Fields | |
---|---|
tuning_ |
List of TuningJobs in the requested page. |
next_ |
A token to retrieve the next page of results. Pass to |
LogprobsResult
Logprobs Result
Fields | |
---|---|
top_ |
Length = total number of decoding steps. |
chosen_ |
Length = total number of decoding steps. The chosen candidates may or may not be in top_candidates. |
Candidate
Candidate for the logprobs token and score.
Fields | |
---|---|
token |
The candidate's token string value. |
token_ |
The candidate's token id value. |
log_ |
The candidate's log probability. |
TopCandidates
Candidates with top log probabilities at each decoding step.
Fields | |
---|---|
candidates[] |
Sorted by log probability in descending order. |
MetricxInput
Input for MetricX metric.
Fields | |
---|---|
metric_ |
Required. Spec for Metricx metric. |
instance |
Required. Metricx instance. |
MetricxInstance
Spec for MetricX instance - The fields used for evaluation are dependent on the MetricX version.
Fields | |
---|---|
prediction |
Required. Output of the evaluated model. |
reference |
Optional. Ground truth used to compare against the prediction. |
source |
Optional. Source text in original language. |
MetricxResult
Spec for MetricX result - calculates the MetricX score for the given instance using the version specified in the spec.
Fields | |
---|---|
score |
Output only. MetricX score. Range depends on version. |
MetricxSpec
Spec for MetricX metric.
Fields | |
---|---|
source_ |
Optional. Source language in BCP-47 format. |
target_ |
Optional. Target language in BCP-47 format. Covers both prediction and reference. |
version |
Required. Which version to use for evaluation. |
MetricxVersion
MetricX Version options.
Enums | |
---|---|
METRICX_VERSION_UNSPECIFIED |
MetricX version unspecified. |
METRICX_24_REF |
MetricX 2024 (2.6) for translation + reference (reference-based). |
METRICX_24_SRC |
MetricX 2024 (2.6) for translation + source (QE). |
METRICX_24_SRC_REF |
MetricX 2024 (2.6) for translation + source + reference (source-reference-combined). |
PairwiseChoice
Pairwise prediction autorater preference.
Enums | |
---|---|
PAIRWISE_CHOICE_UNSPECIFIED |
Unspecified prediction choice. |
BASELINE |
Baseline prediction wins |
CANDIDATE |
Candidate prediction wins |
TIE |
Winner cannot be determined |
PairwiseMetricInput
Input for pairwise metric.
Fields | |
---|---|
metric_ |
Required. Spec for pairwise metric. |
instance |
Required. Pairwise metric instance. |
PairwiseMetricInstance
Pairwise metric instance. Usually one instance corresponds to one row in an evaluation dataset.
Fields | |
---|---|
Union field instance . Instance for pairwise metric. instance can be only one of the following: |
|
json_ |
Instance specified as a json string. String key-value pairs are expected in the json_instance to render PairwiseMetricSpec.instance_prompt_template. |
PairwiseMetricResult
Spec for pairwise metric result.
Fields | |
---|---|
pairwise_ |
Output only. Pairwise metric choice. |
explanation |
Output only. Explanation for pairwise metric score. |
PairwiseMetricSpec
Spec for pairwise metric.
Fields | |
---|---|
metric_ |
Required. Metric prompt template for pairwise metric. |
PairwiseQuestionAnsweringQualityInput
Input for pairwise question answering quality metric.
Fields | |
---|---|
metric_ |
Required. Spec for pairwise question answering quality score metric. |
instance |
Required. Pairwise question answering quality instance. |
PairwiseQuestionAnsweringQualityInstance
Spec for pairwise question answering quality instance.
Fields | |
---|---|
prediction |
Required. Output of the candidate model. |
baseline_ |
Required. Output of the baseline model. |
reference |
Optional. Ground truth used to compare against the prediction. |
context |
Required. Text to answer the question. |
instruction |
Required. Question Answering prompt for LLM. |
PairwiseQuestionAnsweringQualityResult
Spec for pairwise question answering quality result.
Fields | |
---|---|
pairwise_ |
Output only. Pairwise question answering prediction choice. |
explanation |
Output only. Explanation for question answering quality score. |
confidence |
Output only. Confidence for question answering quality score. |
PairwiseQuestionAnsweringQualitySpec
Spec for pairwise question answering quality score metric.
Fields | |
---|---|
use_ |
Optional. Whether to use instance.reference to compute question answering quality. |
version |
Optional. Which version to use for evaluation. |
PairwiseSummarizationQualityInput
Input for pairwise summarization quality metric.
Fields | |
---|---|
metric_ |
Required. Spec for pairwise summarization quality score metric. |
instance |
Required. Pairwise summarization quality instance. |
PairwiseSummarizationQualityInstance
Spec for pairwise summarization quality instance.
Fields | |
---|---|
prediction |
Required. Output of the candidate model. |
baseline_ |
Required. Output of the baseline model. |
reference |
Optional. Ground truth used to compare against the prediction. |
context |
Required. Text to be summarized. |
instruction |
Required. Summarization prompt for LLM. |
PairwiseSummarizationQualityResult
Spec for pairwise summarization quality result.
Fields | |
---|---|
pairwise_ |
Output only. Pairwise summarization prediction choice. |
explanation |
Output only. Explanation for summarization quality score. |
confidence |
Output only. Confidence for summarization quality score. |
PairwiseSummarizationQualitySpec
Spec for pairwise summarization quality score metric.
Fields | |
---|---|
use_ |
Optional. Whether to use instance.reference to compute pairwise summarization quality. |
version |
Optional. Which version to use for evaluation. |
Part
A datatype containing media that is part of a multi-part Content
message.
A Part
consists of data which has an associated datatype. A Part
can only contain one of the accepted types in Part.data
.
A Part
must have a fixed IANA MIME type identifying the type and subtype of the media if inline_data
or file_data
field is filled with raw bytes.
Fields | |
---|---|
Union field
|
|
text |
Optional. Text part (can be code). |
inline_ |
Optional. Inlined bytes data. |
file_ |
Optional. URI based data. |
function_ |
Optional. A predicted [FunctionCall] returned from the model that contains a string representing the [FunctionDeclaration.name] with the parameters and their values. |
function_ |
Optional. The result output of a [FunctionCall] that contains a string representing the [FunctionDeclaration.name] and a structured JSON object containing any output from the function call. It is used as context to the model. |
executable_ |
Optional. Code generated by the model that is meant to be executed. |
code_ |
Optional. Result of executing the [ExecutableCode]. |
Union field
|
|
video_ |
Optional. Video metadata. The metadata should only be specified while the video data is presented in inline_data or file_data. |
PartnerModelTuningSpec
Tuning spec for Partner models.
Fields | |
---|---|
training_ |
Required. Cloud Storage path to file containing training dataset for tuning. The dataset must be formatted as a JSONL file. |
validation_ |
Optional. Cloud Storage path to file containing validation dataset for tuning. The dataset must be formatted as a JSONL file. |
hyper_ |
Hyperparameters for tuning. The accepted hyper_parameters and their valid range of values will differ depending on the base model. |
PointwiseMetricInput
Input for pointwise metric.
Fields | |
---|---|
metric_ |
Required. Spec for pointwise metric. |
instance |
Required. Pointwise metric instance. |
PointwiseMetricInstance
Pointwise metric instance. Usually one instance corresponds to one row in an evaluation dataset.
Fields | |
---|---|
Union field instance . Instance for pointwise metric. instance can be only one of the following: |
|
json_ |
Instance specified as a json string. String key-value pairs are expected in the json_instance to render PointwiseMetricSpec.instance_prompt_template. |
PointwiseMetricResult
Spec for pointwise metric result.
Fields | |
---|---|
explanation |
Output only. Explanation for pointwise metric score. |
score |
Output only. Pointwise metric score. |
PointwiseMetricSpec
Spec for pointwise metric.
Fields | |
---|---|
metric_ |
Required. Metric prompt template for pointwise metric. |
PrebuiltVoiceConfig
The configuration for the prebuilt speaker to use.
Fields | |
---|---|
voice_ |
The name of the preset voice to use. |
PredictLongRunningRequest
Request message for PredictionService.PredictLongRunning
.
Fields | |
---|---|
endpoint |
Required. The name of the Endpoint requested to serve the prediction. Format: |
instances[] |
Required. The instances that are the input to the prediction call. A DeployedModel may have an upper limit on the number of instances it supports per request, and when it is exceeded the prediction call errors in case of AutoML Models, or, in case of customer created Models, the behaviour is as documented by that Model. The schema of any single instance may be specified via Endpoint's DeployedModels' |
parameters |
Optional. The parameters that govern the prediction. The schema of the parameters may be specified via Endpoint's DeployedModels' |
PredictRequest
Request message for PredictionService.Predict
.
Fields | |
---|---|
endpoint |
Required. The name of the Endpoint requested to serve the prediction. Format: |
instances[] |
Required. The instances that are the input to the prediction call. A DeployedModel may have an upper limit on the number of instances it supports per request, and when it is exceeded the prediction call errors in case of AutoML Models, or, in case of customer created Models, the behaviour is as documented by that Model. The schema of any single instance may be specified via Endpoint's DeployedModels' |
parameters |
The parameters that govern the prediction. The schema of the parameters may be specified via Endpoint's DeployedModels' |
PredictResponse
Response message for PredictionService.Predict
.
Fields | |
---|---|
predictions[] |
The predictions that are the output of the predictions call. The schema of any single prediction may be specified via Endpoint's DeployedModels' |
deployed_ |
ID of the Endpoint's DeployedModel that served this prediction. |
model |
Output only. The resource name of the Model which is deployed as the DeployedModel that this prediction hits. |
model_ |
Output only. The version ID of the Model which is deployed as the DeployedModel that this prediction hits. |
model_ |
Output only. The |
metadata |
Output only. Request-level metadata returned by the model. The metadata type will be dependent upon the model implementation. |
QueryExtensionRequest
Request message for ExtensionExecutionService.QueryExtension
.
Fields | |
---|---|
name |
Required. Name (identifier) of the extension; Format: |
contents[] |
Required. The content of the current conversation with the model. For single-turn queries, this is a single instance. For multi-turn queries, this is a repeated field that contains conversation history + latest request. |
QueryExtensionResponse
Response message for ExtensionExecutionService.QueryExtension
.
Fields | |
---|---|
steps[] |
Steps of extension or LLM interaction, can contain function call, function response, or text response. The last step contains the final response to the query. |
failure_ |
Failure message if any. |
QueryReasoningEngineRequest
Request message for [ReasoningEngineExecutionService.Query][].
Fields | |
---|---|
name |
Required. The name of the ReasoningEngine resource to use. Format: |
input |
Optional. Input content provided by users in JSON object format. Examples include text query, function calling parameters, media bytes, etc. |
class_ |
Optional. Class method to be used for the query. It is optional and defaults to "query" if unspecified. |
QueryReasoningEngineResponse
Response message for [ReasoningEngineExecutionService.Query][]
Fields | |
---|---|
output |
Response provided by users in JSON object format. |
QuestionAnsweringCorrectnessInput
Input for question answering correctness metric.
Fields | |
---|---|
metric_ |
Required. Spec for question answering correctness score metric. |
instance |
Required. Question answering correctness instance. |
QuestionAnsweringCorrectnessInstance
Spec for question answering correctness instance.
Fields | |
---|---|
prediction |
Required. Output of the evaluated model. |
reference |
Optional. Ground truth used to compare against the prediction. |
context |
Optional. Text provided as context to answer the question. |
instruction |
Required. The question asked and other instruction in the inference prompt. |
QuestionAnsweringCorrectnessResult
Spec for question answering correctness result.
Fields | |
---|---|
explanation |
Output only. Explanation for question answering correctness score. |
score |
Output only. Question Answering Correctness score. |
confidence |
Output only. Confidence for question answering correctness score. |
QuestionAnsweringCorrectnessSpec
Spec for question answering correctness metric.
Fields | |
---|---|
use_ |
Optional. Whether to use instance.reference to compute question answering correctness. |
version |
Optional. Which version to use for evaluation. |
QuestionAnsweringHelpfulnessInput
Input for question answering helpfulness metric.
Fields | |
---|---|
metric_ |
Required. Spec for question answering helpfulness score metric. |
instance |
Required. Question answering helpfulness instance. |
QuestionAnsweringHelpfulnessInstance
Spec for question answering helpfulness instance.
Fields | |
---|---|
prediction |
Required. Output of the evaluated model. |
reference |
Optional. Ground truth used to compare against the prediction. |
context |
Optional. Text provided as context to answer the question. |
instruction |
Required. The question asked and other instruction in the inference prompt. |
QuestionAnsweringHelpfulnessResult
Spec for question answering helpfulness result.
Fields | |
---|---|
explanation |
Output only. Explanation for question answering helpfulness score. |
score |
Output only. Question Answering Helpfulness score. |
confidence |
Output only. Confidence for question answering helpfulness score. |
QuestionAnsweringHelpfulnessSpec
Spec for question answering helpfulness metric.
Fields | |
---|---|
use_ |
Optional. Whether to use instance.reference to compute question answering helpfulness. |
version |
Optional. Which version to use for evaluation. |
QuestionAnsweringQualityInput
Input for question answering quality metric.
Fields | |
---|---|
metric_ |
Required. Spec for question answering quality score metric. |
instance |
Required. Question answering quality instance. |
QuestionAnsweringQualityInstance
Spec for question answering quality instance.
Fields | |
---|---|
prediction |
Required. Output of the evaluated model. |
reference |
Optional. Ground truth used to compare against the prediction. |
context |
Required. Text to answer the question. |
instruction |
Required. Question Answering prompt for LLM. |
QuestionAnsweringQualityResult
Spec for question answering quality result.
Fields | |
---|---|
explanation |
Output only. Explanation for question answering quality score. |
score |
Output only. Question Answering Quality score. |
confidence |
Output only. Confidence for question answering quality score. |
QuestionAnsweringQualitySpec
Spec for question answering quality score metric.
Fields | |
---|---|
use_ |
Optional. Whether to use instance.reference to compute question answering quality. |
version |
Optional. Which version to use for evaluation. |
QuestionAnsweringRelevanceInput
Input for question answering relevance metric.
Fields | |
---|---|
metric_ |
Required. Spec for question answering relevance score metric. |
instance |
Required. Question answering relevance instance. |
QuestionAnsweringRelevanceInstance
Spec for question answering relevance instance.
Fields | |
---|---|
prediction |
Required. Output of the evaluated model. |
reference |
Optional. Ground truth used to compare against the prediction. |
context |
Optional. Text provided as context to answer the question. |
instruction |
Required. The question asked and other instruction in the inference prompt. |
QuestionAnsweringRelevanceResult
Spec for question answering relevance result.
Fields | |
---|---|
explanation |
Output only. Explanation for question answering relevance score. |
score |
Output only. Question Answering Relevance score. |
confidence |
Output only. Confidence for question answering relevance score. |
QuestionAnsweringRelevanceSpec
Spec for question answering relevance metric.
Fields | |
---|---|
use_ |
Optional. Whether to use instance.reference to compute question answering relevance. |
version |
Optional. Which version to use for evaluation. |
RagContexts
Relevant contexts for one query.
Fields | |
---|---|
contexts[] |
All its contexts. |
Context
A context of the query.
Fields | |
---|---|
source_ |
If the file is imported from Cloud Storage or Google Drive, source_uri will be original file URI in Cloud Storage or Google Drive; if file is uploaded, source_uri will be file display name. |
source_ |
The file display name. |
text |
The text chunk. |
distance |
The distance between the query dense embedding vector and the context text vector. |
sparse_distance |
The distance between the query sparse embedding vector and the context text vector. |
score |
According to the underlying Vector DB and the selected metric type, the score can be either the distance or the similarity between the query and the context and its range depends on the metric type. For example, if the metric type is COSINE_DISTANCE, it represents the distance between the query and the context. The larger the distance, the less relevant the context is to the query. The range is [0, 2], while 0 means the most relevant and 2 means the least relevant. |
RagCorpus
A RagCorpus is a RagFile container and a project can have multiple RagCorpora.
Fields | |
---|---|
name |
Output only. The resource name of the RagCorpus. |
display_ |
Required. The display name of the RagCorpus. The name can be up to 128 characters long and can consist of any UTF-8 characters. |
description |
Optional. The description of the RagCorpus. |
rag_embedding_model_config |
Optional. Immutable. The embedding model config of the RagCorpus. |
rag_vector_db_config |
Optional. Immutable. The Vector DB config of the RagCorpus. |
create_ |
Output only. Timestamp when this RagCorpus was created. |
update_ |
Output only. Timestamp when this RagCorpus was last updated. |
corpus_ |
Output only. RagCorpus state. |
Union field backend_config . The backend config of the RagCorpus. It can be data store and/or retrieval engine. backend_config can be only one of the following: |
|
vector_ |
Optional. Immutable. The config for the Vector DBs. |
vertex_ |
Optional. Immutable. The config for the Vertex AI Search. |
RagEmbeddingModelConfig
Config for the embedding model to use for RAG.
Fields | |
---|---|
Union field model_config . The model config to use. model_config can be only one of the following: |
|
vertex_ |
The Vertex AI Prediction Endpoint that either refers to a publisher model or an endpoint that is hosting a 1P fine-tuned text embedding model. Endpoints hosting non-1P fine-tuned text embedding models are currently not supported. This is used for dense vector search. |
hybrid_ |
Configuration for hybrid search. |
HybridSearchConfig
Config for hybrid search.
Fields | |
---|---|
sparse_ |
Optional. The configuration for sparse embedding generation. This field is optional the default behavior depends on the vector database choice on the RagCorpus. |
dense_ |
Required. The Vertex AI Prediction Endpoint that hosts the embedding model for dense embedding generations. |
SparseEmbeddingConfig
Configuration for sparse emebdding generation.
Fields | |
---|---|
Union field model . The model to use for sparse embedding generation. model can be only one of the following: |
|
bm25 |
Use BM25 scoring algorithm. |
Bm25
Message for BM25 parameters.
Fields | |
---|---|
multilingual |
Optional. Use multilingual tokenizer if set to true. |
k1 |
Optional. The parameter to control term frequency saturation. It determines the scaling between the matching term frequency and final score. k1 is in the range of [1.2, 3]. The default value is 1.2. |
b |
Optional. The parameter to control document length normalization. It determines how much the document length affects the final score. b is in the range of [0, 1]. The default value is 0.75. |
VertexPredictionEndpoint
Config representing a model hosted on Vertex Prediction Endpoint.
Fields | |
---|---|
endpoint |
Required. The endpoint resource name. Format: |
model |
Output only. The resource name of the model that is deployed on the endpoint. Present only when the endpoint is not a publisher model. Pattern: |
model_ |
Output only. Version ID of the model that is deployed on the endpoint. Present only when the endpoint is not a publisher model. |
RagFile
A RagFile contains user data for chunking, embedding and indexing.
Fields | |
---|---|
name |
Output only. The resource name of the RagFile. |
display_ |
Required. The display name of the RagFile. The name can be up to 128 characters long and can consist of any UTF-8 characters. |
description |
Optional. The description of the RagFile. |
size_ |
Output only. The size of the RagFile in bytes. |
rag_ |
Output only. The type of the RagFile. |
create_ |
Output only. Timestamp when this RagFile was created. |
update_ |
Output only. Timestamp when this RagFile was last updated. |
file_ |
Output only. State of the RagFile. |
Union field rag_file_source . The origin location of the RagFile if it is imported from Google Cloud Storage or Google Drive. rag_file_source can be only one of the following: |
|
gcs_ |
Output only. Google Cloud Storage location of the RagFile. It does not support wildcards in the Cloud Storage uri for now. |
google_ |
Output only. Google Drive location. Supports importing individual files as well as Google Drive folders. |
direct_ |
Output only. The RagFile is encapsulated and uploaded in the UploadRagFile request. |
slack_ |
The RagFile is imported from a Slack channel. |
jira_ |
The RagFile is imported from a Jira query. |
share_ |
The RagFile is imported from a SharePoint source. |
RagFileType
The type of the RagFile.
Enums | |
---|---|
RAG_FILE_TYPE_UNSPECIFIED |
RagFile type is unspecified. |
RAG_FILE_TYPE_TXT |
RagFile type is TXT. |
RAG_FILE_TYPE_PDF |
RagFile type is PDF. |
RagFileChunkingConfig
Specifies the size and overlap of chunks for RagFiles.
Fields | |
---|---|
chunk_size |
The size of the chunks. |
chunk_overlap |
The overlap between chunks. |
Union field chunking_config . Specifies the chunking config for RagFiles. chunking_config can be only one of the following: |
|
fixed_ |
Specifies the fixed length chunking config. |
FixedLengthChunking
Specifies the fixed length chunking config.
Fields | |
---|---|
chunk_ |
The size of the chunks. |
chunk_ |
The overlap between chunks. |
RagFileParsingConfig
Specifies the parsing config for RagFiles.
Fields | |
---|---|
use_advanced_pdf_parsing |
Whether to use advanced PDF parsing. |
Union field parser . The parser to use for RagFiles. parser can be only one of the following: |
|
advanced_ |
The Advanced Parser to use for RagFiles. |
layout_ |
The Layout Parser to use for RagFiles. |
llm_ |
The LLM Parser to use for RagFiles. |
AdvancedParser
Specifies the advanced parsing for RagFiles.
Fields | |
---|---|
use_ |
Whether to use advanced PDF parsing. |
LayoutParser
Document AI Layout Parser config.
Fields | |
---|---|
processor_ |
The full resource name of a Document AI processor or processor version. The processor must have type |
max_ |
The maximum number of requests the job is allowed to make to the Document AI processor per minute. Consult https://cloud.google.com/document-ai/quotas and the Quota page for your project to set an appropriate value here. If unspecified, a default value of 120 QPM would be used. |
LlmParser
Specifies the advanced parsing for RagFiles.
Fields | |
---|---|
model_ |
The name of a LLM model used for parsing. Format: |
max_ |
The maximum number of requests the job is allowed to make to the LLM model per minute. Consult https://cloud.google.com/vertex-ai/generative-ai/docs/quotas and your document size to set an appropriate value here. If unspecified, a default value of 5000 QPM would be used. |
custom_ |
The prompt to use for parsing. If not specified, a default prompt will be used. |
RagFileTransformationConfig
Specifies the transformation config for RagFiles.
Fields | |
---|---|
rag_ |
Specifies the chunking config for RagFiles. |
RagQuery
A query to retrieve relevant contexts.
Fields | |
---|---|
similarity_top_k |
Optional. The number of contexts to retrieve. |
ranking |
Optional. Configurations for hybrid search results ranking. |
rag_ |
Optional. The retrieval config for the query. |
Union field query . The query to retrieve contexts. Currently only text query is supported. query can be only one of the following: |
|
text |
Optional. The query in text format to get relevant contexts. |
Ranking
Configurations for hybrid search results ranking.
Fields | |
---|---|
alpha |
Optional. Alpha value controls the weight between dense and sparse vector search results. The range is [0, 1], while 0 means sparse vector search only and 1 means dense vector search only. The default value is 0.5 which balances sparse and dense vector search equally. |
RagRetrievalConfig
Specifies the context retrieval config.
Fields | |
---|---|
top_ |
Optional. The number of contexts to retrieve. |
hybrid_ |
Optional. Config for Hybrid Search. |
filter |
Optional. Config for filters. |
ranking |
Optional. Config for ranking and reranking. |
Filter
Config for filters.
Fields | |
---|---|
metadata_ |
Optional. String for metadata filtering. |
Union field vector_db_threshold . Filter contexts retrieved from the vector DB based on either vector distance or vector similarity. vector_db_threshold can be only one of the following: |
|
vector_ |
Optional. Only returns contexts with vector distance smaller than the threshold. |
vector_ |
Optional. Only returns contexts with vector similarity larger than the threshold. |
HybridSearch
Config for Hybrid Search.
Fields | |
---|---|
alpha |
Optional. Alpha value controls the weight between dense and sparse vector search results. The range is [0, 1], while 0 means sparse vector search only and 1 means dense vector search only. The default value is 0.5 which balances sparse and dense vector search equally. |
Ranking
Config for ranking and reranking.
Fields | |
---|---|
Union field ranking_config . Config options for ranking. Currently only Rank Service is supported. ranking_config can be only one of the following: |
|
rank_ |
Optional. Config for Rank Service. |
llm_ |
Optional. Config for LlmRanker. |
LlmRanker
Config for LlmRanker.
Fields | |
---|---|
model_ |
Optional. The model name used for ranking. Format: |
RankService
Config for Rank Service.
Fields | |
---|---|
model_ |
Optional. The model name of the rank service. Format: |
RagVectorDbConfig
Config for the Vector DB to use for RAG.
Fields | |
---|---|
api_ |
Authentication config for the chosen Vector DB. |
rag_ |
Optional. Immutable. The embedding model config of the Vector DB. |
Union field vector_db . The config for the Vector DB. vector_db can be only one of the following: |
|
rag_ |
The config for the RAG-managed Vector DB. |
weaviate |
The config for the Weaviate. |
pinecone |
The config for the Pinecone. |
vertex_ |
The config for the Vertex Feature Store. |
vertex_ |
The config for the Vertex Vector Search. |
Pinecone
The config for the Pinecone.
Fields | |
---|---|
index_ |
Pinecone index name. This value cannot be changed after it's set. |
RagManagedDb
This type has no fields.
The config for the default RAG-managed Vector DB.
VertexFeatureStore
The config for the Vertex Feature Store.
Fields | |
---|---|
feature_ |
The resource name of the FeatureView. Format: |
VertexVectorSearch
The config for the Vertex Vector Search.
Fields | |
---|---|
index_ |
The resource name of the Index Endpoint. Format: |
index |
The resource name of the Index. Format: |
Weaviate
The config for the Weaviate.
Fields | |
---|---|
http_ |
Weaviate DB instance HTTP endpoint. e.g. 34.56.78.90:8080 Vertex RAG only supports HTTP connection to Weaviate. This value cannot be changed after it's set. |
collection_ |
The corresponding collection this corpus maps to. This value cannot be changed after it's set. |
ReasoningEngine
ReasoningEngine provides a customizable runtime for models to determine which actions to take and in which order.
Fields | |
---|---|
name |
Identifier. The resource name of the ReasoningEngine. |
display_ |
Required. The display name of the ReasoningEngine. |
description |
Optional. The description of the ReasoningEngine. |
spec |
Required. Configurations of the ReasoningEngine |
create_ |
Output only. Timestamp when this ReasoningEngine was created. |
update_ |
Output only. Timestamp when this ReasoningEngine was most recently updated. |
etag |
Optional. Used to perform consistent read-modify-write updates. If not set, a blind "overwrite" update happens. |
ReasoningEngineSpec
ReasoningEngine configurations
Fields | |
---|---|
package_ |
Required. User provided package spec of the ReasoningEngine. |
class_ |
Optional. Declarations for object class methods in OpenAPI specification format. |
PackageSpec
User provided package spec like pickled object and package requirements.
Fields | |
---|---|
pickle_ |
Optional. The Cloud Storage URI of the pickled python object. |
dependency_ |
Optional. The Cloud Storage URI of the dependency files in tar.gz format. |
requirements_ |
Optional. The Cloud Storage URI of the |
python_ |
Optional. The Python version. Currently support 3.8, 3.9, 3.10, 3.11. If not specified, default value is 3.10. |
RebaseTunedModelOperationMetadata
Runtime operation information for GenAiTuningService.RebaseTunedModel
.
Fields | |
---|---|
generic_ |
The common part of the operation generic information. |
RebaseTunedModelRequest
Request message for GenAiTuningService.RebaseTunedModel
.
Fields | |
---|---|
parent |
Required. The resource name of the Location into which to rebase the Model. Format: |
tuned_ |
Required. TunedModel reference to retrieve the legacy model information. |
tuning_ |
Optional. The TuningJob to be updated. Users can use this TuningJob field to overwrite tuning configs. |
artifact_ |
Optional. The Google Cloud Storage location to write the artifacts. |
deploy_ |
Optional. By default, bison to gemini migration will always create new model/endpoint, but for gemini-1.0 to gemini-1.5 migration, we default deploy to the same endpoint. See details in this Section. |
Retrieval
Defines a retrieval tool that model can call to access external knowledge.
Fields | |
---|---|
disable_attribution |
Optional. Deprecated. This option is no longer supported. |
Union field source . The source of the retrieval. source can be only one of the following: |
|
vertex_ |
Set to use data source powered by Vertex AI Search. |
vertex_ |
Set to use data source powered by Vertex RAG store. User data is uploaded via the VertexRagDataService. |
RetrievalMetadata
Metadata related to retrieval in the grounding flow.
Fields | |
---|---|
google_ |
Optional. Score indicating how likely information from Google Search could help answer the prompt. The score is in the range |
RetrieveContextsRequest
Request message for VertexRagService.RetrieveContexts
.
Fields | |
---|---|
parent |
Required. The resource name of the Location from which to retrieve RagContexts. The users must have permission to make a call in the project. Format: |
query |
Required. Single RAG retrieve query. |
Union field data_source . Data Source to retrieve contexts. data_source can be only one of the following: |
|
vertex_ |
The data source for Vertex RagStore. |
VertexRagStore
The data source for Vertex RagStore.
Fields | |
---|---|
rag_corpora[] |
Optional. Deprecated. Please use rag_resources to specify the data source. |
rag_ |
Optional. The representation of the rag source. It can be used to specify corpus only or ragfiles. Currently only support one corpus or multiple files from one corpus. In the future we may open up multiple corpora support. |
vector_distance_threshold |
Optional. Only return contexts with vector distance smaller than the threshold. |
RagResource
The definition of the Rag resource.
Fields | |
---|---|
rag_ |
Optional. RagCorpora resource name. Format: |
rag_ |
Optional. rag_file_id. The files should be in the same rag_corpus set in rag_corpus field. |
RetrieveContextsResponse
Response message for VertexRagService.RetrieveContexts
.
Fields | |
---|---|
contexts |
The contexts of the query. |
RougeInput
Input for rouge metric.
Fields | |
---|---|
metric_ |
Required. Spec for rouge score metric. |
instances[] |
Required. Repeated rouge instances. |
RougeInstance
Spec for rouge instance.
Fields | |
---|---|
prediction |
Required. Output of the evaluated model. |
reference |
Required. Ground truth used to compare against the prediction. |
RougeMetricValue
Rouge metric value for an instance.
Fields | |
---|---|
score |
Output only. Rouge score. |
RougeResults
Results for rouge metric.
Fields | |
---|---|
rouge_ |
Output only. Rouge metric values. |
RougeSpec
Spec for rouge score metric - calculates the recall of n-grams in prediction as compared to reference - returns a score ranging between 0 and 1.
Fields | |
---|---|
rouge_ |
Optional. Supported rouge types are rougen[1-9], rougeL, and rougeLsum. |
use_ |
Optional. Whether to use stemmer to compute rouge score. |
split_ |
Optional. Whether to split summaries while using rougeLsum. |
RuntimeConfig
Runtime configuration to run the extension.
Fields | |
---|---|
default_ |
Optional. Default parameters that will be set for all the execution of this extension. If specified, the parameter values can be overridden by values in [[ExecuteExtensionRequest.operation_params]] at request time. The struct should be in a form of map with param name as the key and actual param value as the value. E.g. If this operation requires a param "name" to be set to "abc". you can set this to something like {"name": "abc"}. |
Union field GoogleFirstPartyExtensionConfig . Runtime configurations for Google first party extensions. GoogleFirstPartyExtensionConfig can be only one of the following: |
|
code_ |
Code execution runtime configurations for code interpreter extension. |
vertex_ |
Runtime configuration for Vertex AI Search extension. |
CodeInterpreterRuntimeConfig
Fields | |
---|---|
file_ |
Optional. The Cloud Storage bucket for file input of this Extension. If specified, support input from the Cloud Storage bucket. Vertex Extension Custom Code Service Agent should be granted file reader to this bucket. If not specified, the extension will only accept file contents from request body and reject Cloud Storage file inputs. |
file_ |
Optional. The Cloud Storage bucket for file output of this Extension. If specified, write all output files to the Cloud Storage bucket. Vertex Extension Custom Code Service Agent should be granted file writer to this bucket. If not specified, the file content will be output in response body. |
VertexAISearchRuntimeConfig
Fields | |
---|---|
serving_ |
Optional. Vertex AI Search serving config name. Format: |
engine_ |
Optional. Vertex AI Search engine ID. This is used to construct the search request. By setting this engine_id, API will construct the serving config using the default value to call search API for the user. The engine_id and serving_config_name cannot both be empty at the same time. |
SafetyInput
Input for safety metric.
Fields | |
---|---|
metric_ |
Required. Spec for safety metric. |
instance |
Required. Safety instance. |
SafetyInstance
Spec for safety instance.
Fields | |
---|---|
prediction |
Required. Output of the evaluated model. |
SafetyRating
Safety rating corresponding to the generated content.
Fields | |
---|---|
category |
Output only. Harm category. |
probability |
Output only. Harm probability levels in the content. |
probability_ |
Output only. Harm probability score. |
severity |
Output only. Harm severity levels in the content. |
severity_ |
Output only. Harm severity score. |
blocked |
Output only. Indicates whether the content was filtered out because of this rating. |
HarmProbability
Harm probability levels in the content.
Enums | |
---|---|
HARM_PROBABILITY_UNSPECIFIED |
Harm probability unspecified. |
NEGLIGIBLE |
Negligible level of harm. |
LOW |
Low level of harm. |
MEDIUM |
Medium level of harm. |
HIGH |
High level of harm. |
HarmSeverity
Harm severity levels.
Enums | |
---|---|
HARM_SEVERITY_UNSPECIFIED |
Harm severity unspecified. |
HARM_SEVERITY_NEGLIGIBLE |
Negligible level of harm severity. |
HARM_SEVERITY_LOW |
Low level of harm severity. |
HARM_SEVERITY_MEDIUM |
Medium level of harm severity. |
HARM_SEVERITY_HIGH |
High level of harm severity. |
SafetyResult
Spec for safety result.
Fields | |
---|---|
explanation |
Output only. Explanation for safety score. |
score |
Output only. Safety score. |
confidence |
Output only. Confidence for safety score. |
SafetySetting
Safety settings.
Fields | |
---|---|
category |
Required. Harm category. |
threshold |
Required. The harm block threshold. |
method |
Optional. Specify if the threshold is used for probability or severity score. If not specified, the threshold is used for probability score. |
HarmBlockMethod
Probability vs severity.
Enums | |
---|---|
HARM_BLOCK_METHOD_UNSPECIFIED |
The harm block method is unspecified. |
SEVERITY |
The harm block method uses both probability and severity scores. |
PROBABILITY |
The harm block method uses the probability score. |
HarmBlockThreshold
Probability based thresholds levels for blocking.
Enums | |
---|---|
HARM_BLOCK_THRESHOLD_UNSPECIFIED |
Unspecified harm block threshold. |
BLOCK_LOW_AND_ABOVE |
Block low threshold and above (i.e. block more). |
BLOCK_MEDIUM_AND_ABOVE |
Block medium threshold and above. |
BLOCK_ONLY_HIGH |
Block only high threshold (i.e. block less). |
BLOCK_NONE |
Block none. |
OFF |
Turn off the safety filter. |
SafetySpec
Spec for safety metric.
Fields | |
---|---|
version |
Optional. Which version to use for evaluation. |
Schema
Schema is used to define the format of input/output data. Represents a select subset of an OpenAPI 3.0 schema object. More fields may be added in the future as needed.
Fields | |
---|---|
type |
Optional. The type of the data. |
format |
Optional. The format of the data. Supported formats: for NUMBER type: "float", "double" for INTEGER type: "int32", "int64" for STRING type: "email", "byte", etc |
title |
Optional. The title of the Schema. |
description |
Optional. The description of the data. |
nullable |
Optional. Indicates if the value may be null. |
default |
Optional. Default value of the data. |
items |
Optional. SCHEMA FIELDS FOR TYPE ARRAY Schema of the elements of Type.ARRAY. |
min_ |
Optional. Minimum number of the elements for Type.ARRAY. |
max_ |
Optional. Maximum number of the elements for Type.ARRAY. |
enum[] |
Optional. Possible values of the element of primitive type with enum format. Examples: 1. We can define direction as : {type:STRING, format:enum, enum:["EAST", NORTH", "SOUTH", "WEST"]} 2. We can define apartment number as : {type:INTEGER, format:enum, enum:["101", "201", "301"]} |
properties |
Optional. SCHEMA FIELDS FOR TYPE OBJECT Properties of Type.OBJECT. |
property_ |
Optional. The order of the properties. Not a standard field in open api spec. Only used to support the order of the properties. |
required[] |
Optional. Required properties of Type.OBJECT. |
min_ |
Optional. Minimum number of the properties for Type.OBJECT. |
max_ |
Optional. Maximum number of the properties for Type.OBJECT. |
minimum |
Optional. SCHEMA FIELDS FOR TYPE INTEGER and NUMBER Minimum value of the Type.INTEGER and Type.NUMBER |
maximum |
Optional. Maximum value of the Type.INTEGER and Type.NUMBER |
min_ |
Optional. SCHEMA FIELDS FOR TYPE STRING Minimum length of the Type.STRING |
max_ |
Optional. Maximum length of the Type.STRING |
pattern |
Optional. Pattern of the Type.STRING to restrict a string to a regular expression. |
example |
Optional. Example of the object. Will only populated when the object is the root. |
any_ |
Optional. The value should be validated against any (one or more) of the subschemas in the list. |
SearchEntryPoint
Google search entry point.
Fields | |
---|---|
rendered_ |
Optional. Web content snippet that can be embedded in a web page or an app webview. |
sdk_ |
Optional. Base64 encoded JSON representing array of <search term, search url> tuple. |
Segment
Segment of the content.
Fields | |
---|---|
part_ |
Output only. The index of a Part object within its parent Content object. |
start_ |
Output only. Start index in the given Part, measured in bytes. Offset from the start of the Part, inclusive, starting at zero. |
end_ |
Output only. End index in the given Part, measured in bytes. Offset from the start of the Part, exclusive, starting at zero. |
text |
Output only. The text corresponding to the segment from the response. |
SlackSource
The Slack source for the ImportRagFilesRequest.
Fields | |
---|---|
channels[] |
Required. The Slack channels. |
SlackChannels
SlackChannels contains the Slack channels and corresponding access token.
Fields | |
---|---|
channels[] |
Required. The Slack channel IDs. |
api_ |
Required. The SecretManager secret version resource name (e.g. projects/{project}/secrets/{secret}/versions/{version}) storing the Slack channel access token that has access to the slack channel IDs. See: https://api.slack.com/tutorials/tracks/getting-a-token. |
SlackChannel
SlackChannel contains the Slack channel ID and the time range to import.
Fields | |
---|---|
channel_ |
Required. The Slack channel ID. |
start_ |
Optional. The starting timestamp for messages to import. |
end_ |
Optional. The ending timestamp for messages to import. |
SpeechConfig
The speech generation config.
Fields | |
---|---|
voice_ |
The configuration for the speaker to use. |
StreamDirectPredictRequest
Request message for PredictionService.StreamDirectPredict
.
The first message must contain endpoint
field and optionally [input][]. The subsequent messages must contain [input][].
Fields | |
---|---|
endpoint |
Required. The name of the Endpoint requested to serve the prediction. Format: |
inputs[] |
Optional. The prediction input. |
parameters |
Optional. The parameters that govern the prediction. |
StreamDirectPredictResponse
Response message for PredictionService.StreamDirectPredict
.
Fields | |
---|---|
outputs[] |
The prediction output. |
parameters |
The parameters that govern the prediction. |
StreamDirectRawPredictRequest
Request message for PredictionService.StreamDirectRawPredict
.
The first message must contain endpoint
and method_name
fields and optionally input
. The subsequent messages must contain input
. method_name
in the subsequent messages have no effect.
Fields | |
---|---|
endpoint |
Required. The name of the Endpoint requested to serve the prediction. Format: |
method_ |
Optional. Fully qualified name of the API method being invoked to perform predictions. Format: |
input |
Optional. The prediction input. |
StreamDirectRawPredictResponse
Response message for PredictionService.StreamDirectRawPredict
.
Fields | |
---|---|
output |
The prediction output. |
StreamQueryReasoningEngineRequest
Request message for [ReasoningEngineExecutionService.StreamQuery][].
Fields | |
---|---|
name |
Required. The name of the ReasoningEngine resource to use. Format: |
input |
Optional. Input content provided by users in JSON object format. Examples include text query, function calling parameters, media bytes, etc. |
class_ |
Optional. Class method to be used for the stream query. It is optional and defaults to "stream_query" if unspecified. |
StreamingPredictRequest
Request message for PredictionService.StreamingPredict
.
The first message must contain endpoint
field and optionally [input][]. The subsequent messages must contain [input][].
Fields | |
---|---|
endpoint |
Required. The name of the Endpoint requested to serve the prediction. Format: |
inputs[] |
The prediction input. |
parameters |
The parameters that govern the prediction. |
StreamingPredictResponse
Response message for PredictionService.StreamingPredict
.
Fields | |
---|---|
outputs[] |
The prediction output. |
parameters |
The parameters that govern the prediction. |
StreamingRawPredictRequest
Request message for PredictionService.StreamingRawPredict
.
The first message must contain endpoint
and method_name
fields and optionally input
. The subsequent messages must contain input
. method_name
in the subsequent messages have no effect.
Fields | |
---|---|
endpoint |
Required. The name of the Endpoint requested to serve the prediction. Format: |
method_ |
Fully qualified name of the API method being invoked to perform predictions. Format: |
input |
The prediction input. |
StreamingRawPredictResponse
Response message for PredictionService.StreamingRawPredict
.
Fields | |
---|---|
output |
The prediction output. |
SummarizationHelpfulnessInput
Input for summarization helpfulness metric.
Fields | |
---|---|
metric_ |
Required. Spec for summarization helpfulness score metric. |
instance |
Required. Summarization helpfulness instance. |
SummarizationHelpfulnessInstance
Spec for summarization helpfulness instance.
Fields | |
---|---|
prediction |
Required. Output of the evaluated model. |
reference |
Optional. Ground truth used to compare against the prediction. |
context |
Required. Text to be summarized. |
instruction |
Optional. Summarization prompt for LLM. |
SummarizationHelpfulnessResult
Spec for summarization helpfulness result.
Fields | |
---|---|
explanation |
Output only. Explanation for summarization helpfulness score. |
score |
Output only. Summarization Helpfulness score. |
confidence |
Output only. Confidence for summarization helpfulness score. |
SummarizationHelpfulnessSpec
Spec for summarization helpfulness score metric.
Fields | |
---|---|
use_ |
Optional. Whether to use instance.reference to compute summarization helpfulness. |
version |
Optional. Which version to use for evaluation. |
SummarizationQualityInput
Input for summarization quality metric.
Fields | |
---|---|
metric_ |
Required. Spec for summarization quality score metric. |
instance |
Required. Summarization quality instance. |
SummarizationQualityInstance
Spec for summarization quality instance.
Fields | |
---|---|
prediction |
Required. Output of the evaluated model. |
reference |
Optional. Ground truth used to compare against the prediction. |
context |
Required. Text to be summarized. |
instruction |
Required. Summarization prompt for LLM. |
SummarizationQualityResult
Spec for summarization quality result.
Fields | |
---|---|
explanation |
Output only. Explanation for summarization quality score. |
score |
Output only. Summarization Quality score. |
confidence |
Output only. Confidence for summarization quality score. |
SummarizationQualitySpec
Spec for summarization quality score metric.
Fields | |
---|---|
use_ |
Optional. Whether to use instance.reference to compute summarization quality. |
version |
Optional. Which version to use for evaluation. |
SummarizationVerbosityInput
Input for summarization verbosity metric.
Fields | |
---|---|
metric_ |
Required. Spec for summarization verbosity score metric. |
instance |
Required. Summarization verbosity instance. |
SummarizationVerbosityInstance
Spec for summarization verbosity instance.
Fields | |
---|---|
prediction |
Required. Output of the evaluated model. |
reference |
Optional. Ground truth used to compare against the prediction. |
context |
Required. Text to be summarized. |
instruction |
Optional. Summarization prompt for LLM. |
SummarizationVerbosityResult
Spec for summarization verbosity result.
Fields | |
---|---|
explanation |
Output only. Explanation for summarization verbosity score. |
score |
Output only. Summarization Verbosity score. |
confidence |
Output only. Confidence for summarization verbosity score. |
SummarizationVerbositySpec
Spec for summarization verbosity score metric.
Fields | |
---|---|
use_ |
Optional. Whether to use instance.reference to compute summarization verbosity. |
version |
Optional. Which version to use for evaluation. |
SupervisedHyperParameters
Hyperparameters for SFT.
Fields | |
---|---|
epoch_ |
Optional. Number of complete passes the model makes over the entire training dataset during training. |
learning_ |
Optional. Multiplier for adjusting the default learning rate. |
adapter_ |
Optional. Adapter size for tuning. |
AdapterSize
Supported adapter sizes for tuning.
Enums | |
---|---|
ADAPTER_SIZE_UNSPECIFIED |
Adapter size is unspecified. |
ADAPTER_SIZE_ONE |
Adapter size 1. |
ADAPTER_SIZE_FOUR |
Adapter size 4. |
ADAPTER_SIZE_EIGHT |
Adapter size 8. |
ADAPTER_SIZE_SIXTEEN |
Adapter size 16. |
ADAPTER_SIZE_THIRTY_TWO |
Adapter size 32. |
SupervisedTuningDataStats
Tuning data statistics for Supervised Tuning.
Fields | |
---|---|
tuning_ |
Output only. Number of examples in the tuning dataset. |
total_ |
Output only. Number of tuning characters in the tuning dataset. |
total_billable_character_count |
Output only. Number of billable characters in the tuning dataset. |
total_ |
Output only. Number of billable tokens in the tuning dataset. |
tuning_ |
Output only. Number of tuning steps for this Tuning Job. |
user_ |
Output only. Dataset distributions for the user input tokens. |
user_ |
Output only. Dataset distributions for the user output tokens. |
user_ |
Output only. Dataset distributions for the messages per example. |
user_ |
Output only. Sample user messages in the training dataset uri. |
total_ |
The number of examples in the dataset that have been truncated by any amount. |
truncated_ |
A partial sample of the indices (starting from 1) of the truncated examples. |
SupervisedTuningDatasetDistribution
Dataset distribution for Supervised Tuning.
Fields | |
---|---|
sum |
Output only. Sum of a given population of values. |
billable_ |
Output only. Sum of a given population of values that are billable. |
min |
Output only. The minimum of the population values. |
max |
Output only. The maximum of the population values. |
mean |
Output only. The arithmetic mean of the values in the population. |
median |
Output only. The median of the values in the population. |
p5 |
Output only. The 5th percentile of the values in the population. |
p95 |
Output only. The 95th percentile of the values in the population. |
buckets[] |
Output only. Defines the histogram bucket. |
DatasetBucket
Dataset bucket used to create a histogram for the distribution given a population of values.
Fields | |
---|---|
count |
Output only. Number of values in the bucket. |
left |
Output only. Left bound of the bucket. |
right |
Output only. Right bound of the bucket. |
SupervisedTuningSpec
Tuning Spec for Supervised Tuning for first party models.
Fields | |
---|---|
training_ |
Required. Cloud Storage path to file containing training dataset for tuning. The dataset must be formatted as a JSONL file. |
validation_ |
Optional. Cloud Storage path to file containing validation dataset for tuning. The dataset must be formatted as a JSONL file. |
hyper_ |
Optional. Hyperparameters for SFT. |
Tensor
A tensor value type.
Fields | |
---|---|
dtype |
The data type of tensor. |
shape[] |
Shape of the tensor. |
bool_ |
Type specific representations that make it easy to create tensor protos in all languages. Only the representation corresponding to "dtype" can be set. The values hold the flattened representation of the tensor in row major order. |
string_ |
|
bytes_ |
|
float_ |
|
double_ |
|
int_ |
|
int64_ |
|
uint_ |
|
uint64_ |
|
list_ |
A list of tensor values. |
struct_ |
A map of string to tensor. |
tensor_ |
Serialized raw tensor content. |
DataType
Data type of the tensor.
Enums | |
---|---|
DATA_TYPE_UNSPECIFIED |
Not a legal value for DataType. Used to indicate a DataType field has not been set. |
BOOL |
Data types that all computation devices are expected to be capable to support. |
STRING |
|
FLOAT |
|
DOUBLE |
|
INT8 |
|
INT16 |
|
INT32 |
|
INT64 |
|
UINT8 |
|
UINT16 |
|
UINT32 |
|
UINT64 |
Tool
Tool details that the model may use to generate response.
A Tool
is a piece of code that enables the system to interact with external systems to perform an action, or set of actions, outside of knowledge and scope of the model. A Tool object should contain exactly one type of Tool (e.g FunctionDeclaration, Retrieval or GoogleSearchRetrieval).
Fields | |
---|---|
function_ |
Optional. Function tool type. One or more function declarations to be passed to the model along with the current user query. Model may decide to call a subset of these functions by populating |
retrieval |
Optional. Retrieval tool type. System will always execute the provided retrieval tool(s) to get external knowledge to answer the prompt. Retrieval results are presented to the model for generation. |
google_ |
Optional. GoogleSearch tool type. Tool to support Google Search in Model. Powered by Google. |
google_ |
Optional. GoogleSearchRetrieval tool type. Specialized retrieval tool that is powered by Google search. |
code_ |
Optional. CodeExecution tool type. Enables the model to execute code as part of generation. This field is only used by the Gemini Developer API services. |
CodeExecution
This type has no fields.
Tool that executes code generated by the model, and automatically returns the result to the model.
See also [ExecutableCode]and [CodeExecutionResult] which are input and output to this tool.
GoogleSearch
This type has no fields.
GoogleSearch tool type. Tool to support Google Search in Model. Powered by Google.
ToolCall
Spec for tool call.
Fields | |
---|---|
tool_ |
Required. Spec for tool name |
tool_ |
Optional. Spec for tool input |
ToolCallValidInput
Input for tool call valid metric.
Fields | |
---|---|
metric_ |
Required. Spec for tool call valid metric. |
instances[] |
Required. Repeated tool call valid instances. |
ToolCallValidInstance
Spec for tool call valid instance.
Fields | |
---|---|
prediction |
Required. Output of the evaluated model. |
reference |
Required. Ground truth used to compare against the prediction. |
ToolCallValidMetricValue
Tool call valid metric value for an instance.
Fields | |
---|---|
score |
Output only. Tool call valid score. |
ToolCallValidResults
Results for tool call valid metric.
Fields | |
---|---|
tool_ |
Output only. Tool call valid metric values. |
ToolCallValidSpec
This type has no fields.
Spec for tool call valid metric.
ToolConfig
Tool config. This config is shared for all tools provided in the request.
Fields | |
---|---|
function_ |
Optional. Function calling config. |
ToolNameMatchInput
Input for tool name match metric.
Fields | |
---|---|
metric_ |
Required. Spec for tool name match metric. |
instances[] |
Required. Repeated tool name match instances. |
ToolNameMatchInstance
Spec for tool name match instance.
Fields | |
---|---|
prediction |
Required. Output of the evaluated model. |
reference |
Required. Ground truth used to compare against the prediction. |
ToolNameMatchMetricValue
Tool name match metric value for an instance.
Fields | |
---|---|
score |
Output only. Tool name match score. |
ToolNameMatchResults
Results for tool name match metric.
Fields | |
---|---|
tool_ |
Output only. Tool name match metric values. |
ToolNameMatchSpec
This type has no fields.
Spec for tool name match metric.
ToolParameterKVMatchInput
Input for tool parameter key value match metric.
Fields | |
---|---|
metric_ |
Required. Spec for tool parameter key value match metric. |
instances[] |
Required. Repeated tool parameter key value match instances. |
ToolParameterKVMatchInstance
Spec for tool parameter key value match instance.
Fields | |
---|---|
prediction |
Required. Output of the evaluated model. |
reference |
Required. Ground truth used to compare against the prediction. |
ToolParameterKVMatchMetricValue
Tool parameter key value match metric value for an instance.
Fields | |
---|---|
score |
Output only. Tool parameter key value match score. |
ToolParameterKVMatchResults
Results for tool parameter key value match metric.
Fields | |
---|---|
tool_ |
Output only. Tool parameter key value match metric values. |
ToolParameterKVMatchSpec
Spec for tool parameter key value match metric.
Fields | |
---|---|
use_ |
Optional. Whether to use STRICT string match on parameter values. |
ToolParameterKeyMatchInput
Input for tool parameter key match metric.
Fields | |
---|---|
metric_ |
Required. Spec for tool parameter key match metric. |
instances[] |
Required. Repeated tool parameter key match instances. |
ToolParameterKeyMatchInstance
Spec for tool parameter key match instance.
Fields | |
---|---|
prediction |
Required. Output of the evaluated model. |
reference |
Required. Ground truth used to compare against the prediction. |
ToolParameterKeyMatchMetricValue
Tool parameter key match metric value for an instance.
Fields | |
---|---|
score |
Output only. Tool parameter key match score. |
ToolParameterKeyMatchResults
Results for tool parameter key match metric.
Fields | |
---|---|
tool_ |
Output only. Tool parameter key match metric values. |
ToolParameterKeyMatchSpec
This type has no fields.
Spec for tool parameter key match metric.
ToolUseExample
A single example of the tool usage.
Fields | |
---|---|
display_ |
Required. The display name for example. |
query |
Required. Query that should be routed to this tool. |
request_ |
Request parameters used for executing this tool. |
response_ |
Response parameters generated by this tool. |
response_ |
Summary of the tool response to the user query. |
Union field Target . Target tool to use. Target can be only one of the following: |
|
extension_ |
Extension operation to call. |
function_ |
Function name to call. |
ExtensionOperation
Identifies one operation of the extension.
Fields | |
---|---|
extension |
Resource name of the extension. |
operation_ |
Required. Operation ID of the extension. |
Trajectory
Spec for trajectory.
Fields | |
---|---|
tool_ |
Required. Tool calls in the trajectory. |
TrajectoryAnyOrderMatchInput
Instances and metric spec for TrajectoryAnyOrderMatch metric.
Fields | |
---|---|
metric_ |
Required. Spec for TrajectoryAnyOrderMatch metric. |
instances[] |
Required. Repeated TrajectoryAnyOrderMatch instance. |
TrajectoryAnyOrderMatchInstance
Spec for TrajectoryAnyOrderMatch instance.
Fields | |
---|---|
predicted_ |
Required. Spec for predicted tool call trajectory. |
reference_ |
Required. Spec for reference tool call trajectory. |
TrajectoryAnyOrderMatchMetricValue
TrajectoryAnyOrderMatch metric value for an instance.
Fields | |
---|---|
score |
Output only. TrajectoryAnyOrderMatch score. |
TrajectoryAnyOrderMatchResults
Results for TrajectoryAnyOrderMatch metric.
Fields | |
---|---|
trajectory_ |
Output only. TrajectoryAnyOrderMatch metric values. |
TrajectoryAnyOrderMatchSpec
This type has no fields.
Spec for TrajectoryAnyOrderMatch metric - returns 1 if all tool calls in the reference trajectory appear in the predicted trajectory in any order, else 0.
TrajectoryExactMatchInput
Instances and metric spec for TrajectoryExactMatch metric.
Fields | |
---|---|
metric_ |
Required. Spec for TrajectoryExactMatch metric. |
instances[] |
Required. Repeated TrajectoryExactMatch instance. |
TrajectoryExactMatchInstance
Spec for TrajectoryExactMatch instance.
Fields | |
---|---|
predicted_ |
Required. Spec for predicted tool call trajectory. |
reference_ |
Required. Spec for reference tool call trajectory. |
TrajectoryExactMatchMetricValue
TrajectoryExactMatch metric value for an instance.
Fields | |
---|---|
score |
Output only. TrajectoryExactMatch score. |
TrajectoryExactMatchResults
Results for TrajectoryExactMatch metric.
Fields | |
---|---|
trajectory_ |
Output only. TrajectoryExactMatch metric values. |
TrajectoryExactMatchSpec
This type has no fields.
Spec for TrajectoryExactMatch metric - returns 1 if tool calls in the reference trajectory exactly match the predicted trajectory, else 0.
TrajectoryInOrderMatchInput
Instances and metric spec for TrajectoryInOrderMatch metric.
Fields | |
---|---|
metric_ |
Required. Spec for TrajectoryInOrderMatch metric. |
instances[] |
Required. Repeated TrajectoryInOrderMatch instance. |
TrajectoryInOrderMatchInstance
Spec for TrajectoryInOrderMatch instance.
Fields | |
---|---|
predicted_ |
Required. Spec for predicted tool call trajectory. |
reference_ |
Required. Spec for reference tool call trajectory. |
TrajectoryInOrderMatchMetricValue
TrajectoryInOrderMatch metric value for an instance.
Fields | |
---|---|
score |
Output only. TrajectoryInOrderMatch score. |
TrajectoryInOrderMatchResults
Results for TrajectoryInOrderMatch metric.
Fields | |
---|---|
trajectory_ |
Output only. TrajectoryInOrderMatch metric values. |
TrajectoryInOrderMatchSpec
This type has no fields.
Spec for TrajectoryInOrderMatch metric - returns 1 if tool calls in the reference trajectory appear in the predicted trajectory in the same order, else 0.
TrajectoryPrecisionInput
Instances and metric spec for TrajectoryPrecision metric.
Fields | |
---|---|
metric_ |
Required. Spec for TrajectoryPrecision metric. |
instances[] |
Required. Repeated TrajectoryPrecision instance. |
TrajectoryPrecisionInstance
Spec for TrajectoryPrecision instance.
Fields | |
---|---|
predicted_ |
Required. Spec for predicted tool call trajectory. |
reference_ |
Required. Spec for reference tool call trajectory. |
TrajectoryPrecisionMetricValue
TrajectoryPrecision metric value for an instance.
Fields | |
---|---|
score |
Output only. TrajectoryPrecision score. |
TrajectoryPrecisionResults
Results for TrajectoryPrecision metric.
Fields | |
---|---|
trajectory_ |
Output only. TrajectoryPrecision metric values. |
TrajectoryPrecisionSpec
This type has no fields.
Spec for TrajectoryPrecision metric - returns a float score based on average precision of individual tool calls.
TrajectoryRecallInput
Instances and metric spec for TrajectoryRecall metric.
Fields | |
---|---|
metric_ |
Required. Spec for TrajectoryRecall metric. |
instances[] |
Required. Repeated TrajectoryRecall instance. |
TrajectoryRecallInstance
Spec for TrajectoryRecall instance.
Fields | |
---|---|
predicted_ |
Required. Spec for predicted tool call trajectory. |
reference_ |
Required. Spec for reference tool call trajectory. |
TrajectoryRecallMetricValue
TrajectoryRecall metric value for an instance.
Fields | |
---|---|
score |
Output only. TrajectoryRecall score. |
TrajectoryRecallResults
Results for TrajectoryRecall metric.
Fields | |
---|---|
trajectory_ |
Output only. TrajectoryRecall metric values. |
TrajectoryRecallSpec
This type has no fields.
Spec for TrajectoryRecall metric - returns a float score based on average recall of individual tool calls.
TrajectorySingleToolUseInput
Instances and metric spec for TrajectorySingleToolUse metric.
Fields | |
---|---|
metric_ |
Required. Spec for TrajectorySingleToolUse metric. |
instances[] |
Required. Repeated TrajectorySingleToolUse instance. |
TrajectorySingleToolUseInstance
Spec for TrajectorySingleToolUse instance.
Fields | |
---|---|
predicted_ |
Required. Spec for predicted tool call trajectory. |
TrajectorySingleToolUseMetricValue
TrajectorySingleToolUse metric value for an instance.
Fields | |
---|---|
score |
Output only. TrajectorySingleToolUse score. |
TrajectorySingleToolUseResults
Results for TrajectorySingleToolUse metric.
Fields | |
---|---|
trajectory_ |
Output only. TrajectorySingleToolUse metric values. |
TrajectorySingleToolUseSpec
Spec for TrajectorySingleToolUse metric - returns 1 if tool is present in the predicted trajectory, else 0.
Fields | |
---|---|
tool_ |
Required. Spec for tool name to be checked for in the predicted trajectory. |
TunedModel
The Model Registry Model and Online Prediction Endpoint assiociated with this TuningJob
.
Fields | |
---|---|
model |
Output only. The resource name of the TunedModel. Format: |
endpoint |
Output only. A resource name of an Endpoint. Format: |
TunedModelRef
TunedModel Reference for legacy model migration.
Fields | |
---|---|
Union field tuned_model_ref . The Tuned Model Reference for the model. tuned_model_ref can be only one of the following: |
|
tuned_ |
Support migration from model registry. |
tuning_ |
Support migration from tuning job list page, from gemini-1.0-pro-002 to 1.5 and above. |
pipeline_ |
Support migration from tuning job list page, from bison model to gemini model. |
TuningDataStats
The tuning data statistic values for TuningJob
.
Fields | |
---|---|
Union field
|
|
supervised_ |
The SFT Tuning data stats. |
distillation_ |
Output only. Statistics for distillation. |
TuningJob
Represents a TuningJob that runs with Google owned models.
Fields | |
---|---|
name |
Output only. Identifier. Resource name of a TuningJob. Format: |
tuned_ |
Optional. The display name of the |
description |
Optional. The description of the |
state |
Output only. The detailed state of the job. |
create_ |
Output only. Time when the |
start_ |
Output only. Time when the |
end_ |
Output only. Time when the TuningJob entered any of the following |
update_ |
Output only. Time when the |
error |
Output only. Only populated when job's state is |
labels |
Optional. The labels with user-defined metadata to organize Label keys and values can be no longer than 64 characters (Unicode codepoints), can only contain lowercase letters, numeric characters, underscores and dashes. International characters are allowed. See https://goo.gl/xmQnxf for more information and examples of labels. |
experiment |
Output only. The Experiment associated with this |
tuned_ |
Output only. The tuned model resources assiociated with this |
tuning_ |
Output only. The tuning data statistics associated with this |
pipeline_job |
Output only. The resource name of the PipelineJob associated with the |
encryption_ |
Customer-managed encryption key options for a TuningJob. If this is set, then all resources created by the TuningJob will be encrypted with the provided encryption key. |
service_ |
The service account that the tuningJob workload runs as. If not specified, the Vertex AI Secure Fine-Tuned Service Agent in the project will be used. See https://cloud.google.com/iam/docs/service-agents#vertex-ai-secure-fine-tuning-service-agent Users starting the pipeline must have the |
Union field
|
|
base_ |
The base model that is being tuned, e.g., "gemini-1.0-pro-002". . |
Union field
|
|
supervised_ |
Tuning Spec for Supervised Fine Tuning. |
distillation_ |
Tuning Spec for Distillation. |
partner_ |
Tuning Spec for open sourced and third party Partner models. |
Type
Type contains the list of OpenAPI data types as defined by https://swagger.io/docs/specification/data-models/data-types/
Enums | |
---|---|
TYPE_UNSPECIFIED |
Not specified, should not be used. |
STRING |
OpenAPI string type |
NUMBER |
OpenAPI number type |
INTEGER |
OpenAPI integer type |
BOOLEAN |
OpenAPI boolean type |
ARRAY |
OpenAPI array type |
OBJECT |
OpenAPI object type |
UpdateCacheConfigRequest
Request message for updating a cache config.
Fields | |
---|---|
cache_ |
Required. The cache config to be updated. |
UpdateCachedContentRequest
Request message for GenAiCacheService.UpdateCachedContent
. Only expire_time or ttl can be updated.
Fields | |
---|---|
cached_ |
Required. The cached content to update |
update_ |
Required. The list of fields to update. |
UpdateExtensionRequest
Request message for ExtensionRegistryService.UpdateExtension
.
Fields | |
---|---|
extension |
Required. The Extension which replaces the resource on the server. |
update_ |
Required. Mask specifying which fields to update. Supported fields:
|
UpdateRagCorpusOperationMetadata
Runtime operation information for VertexRagDataService.UpdateRagCorpus
.
Fields | |
---|---|
generic_ |
The operation generic information. |
UpdateRagCorpusRequest
Request message for VertexRagDataService.UpdateRagCorpus
.
Fields | |
---|---|
rag_ |
Required. The RagCorpus which replaces the resource on the server. |
UpdateReasoningEngineOperationMetadata
Details of ReasoningEngineService.UpdateReasoningEngine
operation.
Fields | |
---|---|
generic_ |
The common part of the operation metadata. |
UpdateReasoningEngineRequest
Request message for ReasoningEngineService.UpdateReasoningEngine
.
Fields | |
---|---|
reasoning_ |
Required. The ReasoningEngine which replaces the resource on the server. |
update_ |
Optional. Mask specifying which fields to update. |
UploadRagFileConfig
Config for uploading RagFile.
Fields | |
---|---|
rag_file_chunking_config |
Specifies the size and overlap of chunks after uploading RagFile. |
rag_ |
Specifies the transformation config for RagFiles. |
VertexAISearch
Retrieve from Vertex AI Search datastore for grounding. See https://cloud.google.com/products/agent-builder
Fields | |
---|---|
datastore |
Required. Fully-qualified Vertex AI Search data store resource ID. Format: |
VertexAiSearchConfig
Config for the Vertex AI Search.
Fields | |
---|---|
serving_ |
Vertex AI Search Serving Config resource full name. For example, |
VertexRagStore
Retrieve from Vertex RAG Store for grounding.
Fields | |
---|---|
rag_corpora[] |
Optional. Deprecated. Please use rag_resources instead. |
rag_ |
Optional. The representation of the rag source. It can be used to specify corpus only or ragfiles. Currently only support one corpus or multiple files from one corpus. In the future we may open up multiple corpora support. |
rag_ |
Optional. The retrieval config for the Rag query. |
similarity_top_k |
Optional. Number of top k results to return from the selected corpora. |
vector_distance_threshold |
Optional. Only return results with vector distance smaller than the threshold. |
RagResource
The definition of the Rag resource.
Fields | |
---|---|
rag_ |
Optional. RagCorpora resource name. Format: |
rag_ |
Optional. rag_file_id. The files should be in the same rag_corpus set in rag_corpus field. |
VideoMetadata
Metadata describes the input video content.
Fields | |
---|---|
start_ |
Optional. The start offset of the video. |
end_ |
Optional. The end offset of the video. |
VoiceConfig
The configuration for the voice to use.
Fields | |
---|---|
Union field voice_config . The configuration for the speaker to use. voice_config can be only one of the following: |
|
prebuilt_ |
The configuration for the prebuilt voice to use. |