Class PredictionServiceGrpc.PredictionServiceFutureStub (3.41.0)

public static final class PredictionServiceGrpc.PredictionServiceFutureStub extends AbstractFutureStub<PredictionServiceGrpc.PredictionServiceFutureStub>

A stub to allow clients to do ListenableFuture-style rpc calls to service PredictionService.

A service for online predictions and explanations.

Inheritance

java.lang.Object > io.grpc.stub.AbstractStub > io.grpc.stub.AbstractFutureStub > PredictionServiceGrpc.PredictionServiceFutureStub

Methods

build(Channel channel, CallOptions callOptions)

protected PredictionServiceGrpc.PredictionServiceFutureStub build(Channel channel, CallOptions callOptions)
Parameters
NameDescription
channelio.grpc.Channel
callOptionsio.grpc.CallOptions
Returns
TypeDescription
PredictionServiceGrpc.PredictionServiceFutureStub
Overrides
io.grpc.stub.AbstractStub.build(io.grpc.Channel,io.grpc.CallOptions)

countTokens(CountTokensRequest request)

public ListenableFuture<CountTokensResponse> countTokens(CountTokensRequest request)

Perform a token counting.

Parameter
NameDescription
requestCountTokensRequest
Returns
TypeDescription
com.google.common.util.concurrent.ListenableFuture<CountTokensResponse>

directPredict(DirectPredictRequest request)

public ListenableFuture<DirectPredictResponse> directPredict(DirectPredictRequest request)

Perform an unary online prediction request to a gRPC model server for Vertex first-party products and frameworks.

Parameter
NameDescription
requestDirectPredictRequest
Returns
TypeDescription
com.google.common.util.concurrent.ListenableFuture<DirectPredictResponse>

directRawPredict(DirectRawPredictRequest request)

public ListenableFuture<DirectRawPredictResponse> directRawPredict(DirectRawPredictRequest request)

Perform an unary online prediction request to a gRPC model server for custom containers.

Parameter
NameDescription
requestDirectRawPredictRequest
Returns
TypeDescription
com.google.common.util.concurrent.ListenableFuture<DirectRawPredictResponse>

explain(ExplainRequest request)

public ListenableFuture<ExplainResponse> explain(ExplainRequest request)

Perform an online explanation. If deployed_model_id is specified, the corresponding DeployModel must have explanation_spec populated. If deployed_model_id is not specified, all DeployedModels must have explanation_spec populated.

Parameter
NameDescription
requestExplainRequest
Returns
TypeDescription
com.google.common.util.concurrent.ListenableFuture<ExplainResponse>

generateContent(GenerateContentRequest request)

public ListenableFuture<GenerateContentResponse> generateContent(GenerateContentRequest request)

Generate content with multimodal inputs.

Parameter
NameDescription
requestGenerateContentRequest
Returns
TypeDescription
com.google.common.util.concurrent.ListenableFuture<GenerateContentResponse>

predict(PredictRequest request)

public ListenableFuture<PredictResponse> predict(PredictRequest request)

Perform an online prediction.

Parameter
NameDescription
requestPredictRequest
Returns
TypeDescription
com.google.common.util.concurrent.ListenableFuture<PredictResponse>

rawPredict(RawPredictRequest request)

public ListenableFuture<HttpBody> rawPredict(RawPredictRequest request)

Perform an online prediction with an arbitrary HTTP payload. The response includes the following HTTP headers:

  • X-Vertex-AI-Endpoint-Id: ID of the Endpoint that served this prediction.
  • X-Vertex-AI-Deployed-Model-Id: ID of the Endpoint's DeployedModel that served this prediction.
Parameter
NameDescription
requestRawPredictRequest
Returns
TypeDescription
com.google.common.util.concurrent.ListenableFuture<com.google.api.HttpBody>