Package @google-cloud/vertexai (0.3.1)

Classes

ChatSession

Chat session to make multi-turn send message request. sendMessage method makes async call to get response of a chat message. sendMessageStream method makes async call to stream response of a chat message.

ClientError

ClientError is thrown when http 4XX status is received. For details please refer to https://developer.mozilla.org/en-US/docs/Web/HTTP/Status#client_error_responses

GenerativeModel

Base class for generative models. NOTE: this class should not be instantiated directly. Use vertexai.preview.getGenerativeModel() instead.

GoogleAuthError

GoogleAuthError is thrown when there is authentication issue with the request

GoogleGenerativeAIError

GoogleGenerativeAIError is thrown when http response is not ok and status code is not 4XX For details please refer to https://developer.mozilla.org/en-US/docs/Web/HTTP/Status

VertexAI_Preview

VertexAI class internal implementation for authentication.

VertexAI

Base class for authenticating to Vertex, creates the preview namespace.

Interfaces

BaseModelParams

Base params for initializing a model or calling GenerateContent. {SafetySetting[]} - [safety_settings] Array of SafetySetting {GenerationConfig} - [generation_config] GenerationConfig

BasePart

A part of a turn in a conversation with the model with a fixed MIME type. It has one of the following mutually exclusive fields: 1. text 2. inline_data 3. file_data 4. functionResponse 5. functionCall

CitationMetadata

A collection of source attributions for a piece of content. {CitationSource[]} - citationSources. Array of CitationSource

CitationSource

Source attributions for content. {number} - [startIndex] Start index into the content. {number} - [endIndex] End index into the content. {string} - [url] Url reference of the attribution. {string} - [license] License of the attribution.

Content

The base structured datatype containing multi-part content of a message. {Part[]} - parts. Array of Part {string} - [role]. The producer of the content. Must be either 'user' or 'model'. Useful to set for multi-turn conversations, otherwise can be left blank or unset.

CountTokensRequest

Params used to call the countTokens method. {Content[]} - contents. Array of Content

CountTokensResponse

Response returned from countTokens method. {number} - totalTokens. The total number of tokens counted across all instances from the request. {number} - [totalBillableCharacters]. The total number of billable characters counted across all instances from the request.

FileData

URI based data. {string} - mime_type. The IANA standard MIME type of the source data. {string} - file_uri. URI of the file.

FileDataPart

A file data part of a conversation with the model. {never} - [text]. text is not expected for FileDataPart. {never} - [inline_data]. inline_data is not expected for FileDataPart. {FileData} - file_data. Only this property is expected for FileDataPart. FileData {never} - [functionResponse]. functionResponse is not expected for FileDataPart. {never} - [functionCall]. functionCall is not expected for FileDataPart.

FunctionCall

A predicted FunctionCall returned from the model that contains a string representating the FunctionDeclaration.name with the parameters and their values. {string} - name The name of the function specified in FunctionDeclaration.name. {object} - args The arguments to pass to the function.

FunctionCallPart

A function call part of a conversation with the model. {never} - [text]. text is not expected for FunctionResponsePart. {never} - [inline_data]. inline_data is not expected for FunctionResponsePart. {never} - [file_data]. file_data is not expected for FunctionResponsePart. FileData {never} - [functionResponse]. functionResponse is not expected for FunctionResponsePart. {FunctionCall} - functionCall. only functionCall is expected for FunctionCallPart.

FunctionDeclaration

Structured representation of a function declaration as defined by the [OpenAPI 3.0 specification](https://spec.openapis.org/oas/v3.0.3). Included in this declaration are the function name and parameters. This FunctionDeclaration is a representation of a block of code that can be used as a Tool by the model and executed by the client. {string} - name The name of the function to call. Must start with a letter or an underscore. Must be a-z, A-Z, 0-9, or contain underscores and dashes, with a max length of 64. {string} - description Description and purpose of the function. Model uses it to decide how and whether to call the function. {FunctionDeclarationSchema} - parameters Describes the parameters to this function in JSON Schema Object format. Reflects the Open API 3.03 Parameter Object. string Key: the name of the parameter. Parameter names are case sensitive. Schema Value: the Schema defining the type used for the parameter. For function with no parameters, this can be left unset. Example with 1 required and 1 optional parameter: type: OBJECT properties:

param1:

type: STRING param2:

type: INTEGER required:

  • param1

FunctionDeclarationSchema

Schema for parameters passed to [FunctionDeclaration.parameters]

FunctionDeclarationSchemaProperty

Schema is used to define the format of input/output data. Represents a select subset of an OpenAPI 3.0 schema object. More fields may be added in the future as needed.

FunctionResponse

The result output of a FunctionCall that contains a string representing the FunctionDeclaration.name and a structured JSON object containing any output from the function call. It is used as context to the model. {string} - name The name of the function specified in FunctionDeclaration.name. {object} - response The expected response from the model.

FunctionResponsePart

A function response part of a conversation with the model. {never} - [text]. text is not expected for FunctionResponsePart. {never} - [inline_data]. inline_data is not expected for FunctionResponsePart. {FileData} - [file_data]. file_data is not expected for FunctionResponsePart. FileData {never} - functionResponse. only functionResponse is expected for FunctionResponsePart. {never} - [functionCall]. functionCall is not expected for FunctionResponsePart.

GenerateContentCandidate

A response candidate generated from the model. {Content} - content. Content {number} - [index]. The index of the candidate in the GenerateContentResponse {FinishReason} - [finishReason]. FinishReason {string} - [finishMessage]. {SafetyRating[]} - [safetyRatings]. Array of SafetyRating {CitationMetadata} - [citationMetadata]. CitationMetadata

GenerateContentRequest

Params used to call the generateContent method. {Content[]} - contents. Array of Content

GenerateContentResponse

Response from the model supporting multiple candidates {GenerateContentCandidate} - candidates. GenerateContentCandidate {PromptFeedback} - [promptFeedback]. This is only populated if there are no candidates due to a safety block PromptFeedback {UsageMetadata} - [usageMetadata]. UsageMetadata

GenerateContentResult

Wrapper for respones from a generateContent request {GenerateContentResponse} - response. All GenerateContentResponses received so far GenerateContentResponse

GenerationConfig

Configuration options for model generation and outputs {number} - [candidate_count] Number of candidates to generate. {string[]} - [stop_sequences] Stop sequences. {number} - [max_output_tokens] The maximum number of output tokens to generate per message. {number} - [temperature] Controls the randomness of predictions. {number} - [top_p] If specified, nucleus sampling will be used. {number} - [top_k] If specified, top-k sampling will be used.

GenerativeContentBlob

Raw media bytes sent directly in the request. Text should not be sent as raw bytes. {string} - mime_type. The MIME type of the source data. The only accepted values: "image/png" or "image/jpeg". {string} - data. data must be base64 string

GetGenerativeModelParams

{string} model - model name {string} project - project The Google Cloud project to use for the request {string} location - The Google Cloud project location to use for the request {GoogleAuth} googleAuth - GoogleAuth class instance that handles authentication. Details about GoogleAuth is referred to https://github.com/googleapis/google-auth-library-nodejs/blob/main/src/auth/googleauth.ts {string} - [apiEndpoint] The base Vertex AI endpoint to use for the request. If not provided, the default regionalized endpoint (i.e. us-central1-aiplatform.googleapis.com) will be used. {GenerationConfig} [generation_config] - GenerationConfig {SafetySetting[]} [safety_settings] - SafetySetting {Tool[]} [tools] - Tool

InlineDataPart

An inline data part of a conversation with the model. {never} - [text]. text is not expected for InlineDataPart. {GenerativeContentBlob} - inline_data. Only this property is expected for InlineDataPart. GenerativeContentBlob {never} - [file_data]. file_data is not expected for InlineDataPart. {never} - [functionResponse]. functionResponse is not expected for InlineDataPart. {never} - [functionCall]. functionCall is not expected for InlineDataPart.

ModelParams

Configuration for initializing a model, for example via getGenerativeModel {string} model - model name.

PromptFeedback

Content filter results for a prompt sent in the request. {BlockedReason} - block_reason. {SafetyRating[]} - safety_ratings. Array of SafetyRating {string} - block_reason_message. A readable block reason message.

SafetyRating

Safety rating corresponding to the generated content. {HarmCategory} - category. HarmCategory {HarmProbability} - probability. HarmProbability

SafetySetting

Safety feedback for an entire request. {HarmCategory} - category. HarmCategory {HarmBlockThreshold} - threshold. HarmBlockThreshold

StartChatParams

Params to initiate a multiturn chat with the model via startChat {Content[]} - [history] history of the chat session. Content {SafetySetting[]} - [safety_settings] Array of SafetySetting {GenerationConfig} - [generation_config] GenerationConfig

StartChatSessionRequest

All params passed to initiate multiturn chat via startChat {VertexAI_Preview} - _vertex_instance VertexAI_Preview {GenerativeModel} - _model_instance GenerativeModel

StreamGenerateContentResult

Wrapper for respones from a generateContent method when steam parameter is true {Promise

TextPart

A text part of a conversation with the model. {string} - text. Only this propery is expected for TextPart. {never} - [inline_data]. inline_data is not expected for TextPart. {never} - [file_data]. file_data is not expected for TextPart. {never} - [functionResponse]. functionResponse is not expected for TextPart. {never} - [functionCall]. functionCall is not expected for TextPart.

Tool

A Tool is a piece of code that enables the system to interact with external systems to perform an action, or set of actions, outside of knowledge and scope of the model. {object} - function_declarations One or more function declarations to be passed to the model along with the current user query. Model may decide to call a subset of these functions by populating [FunctionCall][content.part.function_call] in the response. User should provide a [FunctionResponse][content.part.function_response] for each function call in the next turn. Based on the function responses, Model will generate the final response back to the user. Maximum 64 function declarations can be provided.

UsageMetadata

Usage metadata about response(s). {number} - [prompt_token_count]. Number of tokens in the request. {number} - [candidates_token_count]. Number of tokens in the response(s). {number} - [totalTokenCount]. Total number of tokens.

VertexInit

Params used to initialize the Vertex SDK @param{string} project - the project name of your Google Cloud project. It is not the numeric project ID. @param{string} location - the location of your project. @param{string} [apiEndpoint] - If not specified, a default value will be resolved by SDK.

Enums

BlockedReason

{string} The reason why the reponse is blocked. Values: BLOCKED_REASON_UNSPECIFIED Unspecified blocked reason. SAFETY Candidates blocked due to safety. OTHER Candidates blocked due to other reason.

FinishReason

{string} The reason why the model stopped generating tokens. If empty, the model has not stopped generating the tokens. Values: FINISH_REASON_UNSPECIFIED The finish reason is unspecified. STOP: Natural stop point of the model or provided stop sequence. MAX_TOKENS: The maximum number of tokens as specified in the request was reached. SAFETY: The token generation was stopped as the response was flagged for safety reasons. NOTE: When streaming the Candidate.content will be empty if content filters blocked the output. RECITATION: The token generation was stopped as the response was flagged for unauthorized citations. OTHER: All other reasons that stopped the token generation

FunctionDeclarationSchemaType

Contains the list of OpenAPI data types as defined by https://swagger.io/docs/specification/data-models/data-types/

HarmBlockThreshold

{string} Probability based thresholds levels for blocking. Values: HARM_BLOCK_THRESHOLD_UNSPECIFIED: Unspecified harm block threshold. BLOCK_LOW_AND_ABOVE: Block low threshold and above (i.e. block more). BLOCK_MEDIUM_AND_ABOVE: Block medium threshold and above. BLOCK_ONLY_HIGH: Block only high threshold (i.e. block less). BLOCK_NONE: Block none.

HarmCategory

{string} Harm categories that will block the content. Values: HARM_CATEGORY_UNSPECIFIED: The harm category is unspecified. HARM_CATEGORY_HATE_SPEECH: The harm category is hate speech. HARM_CATEGORY_DANGEROUS_CONTENT: The harm category is dangerous content. HARM_CATEGORY_HARASSMENT: The harm category is harassment. HARM_CATEGORY_SEXUALLY_EXPLICIT: The harm category is sexually explicit content.

HarmProbability

{string} Harm probability levels in the content. Values: HARM_PROBABILITY_UNSPECIFIED: Harm probability unspecified. NEGLIGIBLE: Negligible level of harm. LOW: Low level of harm. MEDIUM: Medium level of harm. HIGH: High level of harm.

Type Aliases

Part

export declare type Part = TextPart | InlineDataPart | FileDataPart | FunctionResponsePart | FunctionCallPart;

A datatype containing media that is part of a multi-part Content message. A Part is a union type of TextPart, InlineDataPart, FileDataPart, and FunctionResponsePart. A Part has one of the following mutually exclusive fields: 1. text 2. inline_data 3. file_data 4. functionResponse