Class PredictionServiceGrpc.PredictionServiceBlockingStub (3.42.0)

public static final class PredictionServiceGrpc.PredictionServiceBlockingStub extends AbstractBlockingStub<PredictionServiceGrpc.PredictionServiceBlockingStub>

A stub to allow clients to do synchronous rpc calls to service PredictionService.

A service for online predictions and explanations.

Inheritance

java.lang.Object > io.grpc.stub.AbstractStub > io.grpc.stub.AbstractBlockingStub > PredictionServiceGrpc.PredictionServiceBlockingStub

Methods

build(Channel channel, CallOptions callOptions)

protected PredictionServiceGrpc.PredictionServiceBlockingStub build(Channel channel, CallOptions callOptions)
Parameters
Name Description
channel io.grpc.Channel
callOptions io.grpc.CallOptions
Returns
Type Description
PredictionServiceGrpc.PredictionServiceBlockingStub
Overrides
io.grpc.stub.AbstractStub.build(io.grpc.Channel,io.grpc.CallOptions)

countTokens(CountTokensRequest request)

public CountTokensResponse countTokens(CountTokensRequest request)

Perform a token counting.

Parameter
Name Description
request CountTokensRequest
Returns
Type Description
CountTokensResponse

directPredict(DirectPredictRequest request)

public DirectPredictResponse directPredict(DirectPredictRequest request)

Perform an unary online prediction request to a gRPC model server for Vertex first-party products and frameworks.

Parameter
Name Description
request DirectPredictRequest
Returns
Type Description
DirectPredictResponse

directRawPredict(DirectRawPredictRequest request)

public DirectRawPredictResponse directRawPredict(DirectRawPredictRequest request)

Perform an unary online prediction request to a gRPC model server for custom containers.

Parameter
Name Description
request DirectRawPredictRequest
Returns
Type Description
DirectRawPredictResponse

explain(ExplainRequest request)

public ExplainResponse explain(ExplainRequest request)

Perform an online explanation. If deployed_model_id is specified, the corresponding DeployModel must have explanation_spec populated. If deployed_model_id is not specified, all DeployedModels must have explanation_spec populated.

Parameter
Name Description
request ExplainRequest
Returns
Type Description
ExplainResponse

generateContent(GenerateContentRequest request)

public GenerateContentResponse generateContent(GenerateContentRequest request)

Generate content with multimodal inputs.

Parameter
Name Description
request GenerateContentRequest
Returns
Type Description
GenerateContentResponse

predict(PredictRequest request)

public PredictResponse predict(PredictRequest request)

Perform an online prediction.

Parameter
Name Description
request PredictRequest
Returns
Type Description
PredictResponse

rawPredict(RawPredictRequest request)

public HttpBody rawPredict(RawPredictRequest request)

Perform an online prediction with an arbitrary HTTP payload. The response includes the following HTTP headers:

  • X-Vertex-AI-Endpoint-Id: ID of the Endpoint that served this prediction.
  • X-Vertex-AI-Deployed-Model-Id: ID of the Endpoint's DeployedModel that served this prediction.
Parameter
Name Description
request RawPredictRequest
Returns
Type Description
com.google.api.HttpBody

serverStreamingPredict(StreamingPredictRequest request)

public Iterator<StreamingPredictResponse> serverStreamingPredict(StreamingPredictRequest request)

Perform a server-side streaming online prediction request for Vertex LLM streaming.

Parameter
Name Description
request StreamingPredictRequest
Returns
Type Description
Iterator<StreamingPredictResponse>

streamGenerateContent(GenerateContentRequest request)

public Iterator<GenerateContentResponse> streamGenerateContent(GenerateContentRequest request)

Generate content with multimodal inputs with streaming support.

Parameter
Name Description
request GenerateContentRequest
Returns
Type Description
Iterator<GenerateContentResponse>