Interface PredictionServiceGrpc.AsyncService (3.29.0)
Stay organized with collections Save and categorize content based on your preferences.

public static interface PredictionServiceGrpc.AsyncService

A service for online predictions and explanations.

Methods

countTokens(CountTokensRequest request, StreamObserver<CountTokensResponse> responseObserver)

public default void countTokens(CountTokensRequest request, StreamObserver<CountTokensResponse> responseObserver)

Perform a token counting.

Parameters
Name	Description
`request`	`CountTokensRequest`
`responseObserver`	`io.grpc.stub.StreamObserver<CountTokensResponse>`

explain(ExplainRequest request, StreamObserver<ExplainResponse> responseObserver)

public default void explain(ExplainRequest request, StreamObserver<ExplainResponse> responseObserver)

Perform an online explanation. If deployed_model_id is specified, the corresponding DeployModel must have explanation_spec populated. If deployed_model_id is not specified, all DeployedModels must have explanation_spec populated.

Parameters
Name	Description
`request`	`ExplainRequest`
`responseObserver`	`io.grpc.stub.StreamObserver<ExplainResponse>`

predict(PredictRequest request, StreamObserver<PredictResponse> responseObserver)

public default void predict(PredictRequest request, StreamObserver<PredictResponse> responseObserver)

Perform an online prediction.

Parameters
Name	Description
`request`	`PredictRequest`
`responseObserver`	`io.grpc.stub.StreamObserver<PredictResponse>`

rawPredict(RawPredictRequest request, StreamObserver<HttpBody> responseObserver)

public default void rawPredict(RawPredictRequest request, StreamObserver<HttpBody> responseObserver)

Perform an online prediction with an arbitrary HTTP payload. The response includes the following HTTP headers:

X-Vertex-AI-Endpoint-Id: ID of the Endpoint that served this prediction.
X-Vertex-AI-Deployed-Model-Id: ID of the Endpoint's DeployedModel that served this prediction.

Parameters
Name	Description
`request`	`RawPredictRequest`
`responseObserver`	`io.grpc.stub.StreamObserver<com.google.api.HttpBody>`

serverStreamingPredict(StreamingPredictRequest request, StreamObserver<StreamingPredictResponse> responseObserver)

public default void serverStreamingPredict(StreamingPredictRequest request, StreamObserver<StreamingPredictResponse> responseObserver)

Perform a server-side streaming online prediction request for Vertex LLM streaming.

Parameters
Name	Description
`request`	`StreamingPredictRequest`
`responseObserver`	`io.grpc.stub.StreamObserver<StreamingPredictResponse>`

Interface PredictionServiceGrpc.AsyncService (3.29.0) Stay organized with collections Save and categorize content based on your preferences.