public static final class PredictionServiceGrpc.PredictionServiceFutureStub extends AbstractFutureStub<PredictionServiceGrpc.PredictionServiceFutureStub>
A stub to allow clients to do ListenableFuture-style rpc calls to service PredictionService.
A service for online predictions and explanations.
Inheritance
java.lang.Object >
io.grpc.stub.AbstractStub >
io.grpc.stub.AbstractFutureStub >
PredictionServiceGrpc.PredictionServiceFutureStub
Inherited Members
io.grpc.stub.AbstractFutureStub.<T>newStub(io.grpc.stub.AbstractStub.StubFactory<T>,io.grpc.Channel)
io.grpc.stub.AbstractFutureStub.<T>newStub(io.grpc.stub.AbstractStub.StubFactory<T>,io.grpc.Channel,io.grpc.CallOptions)
io.grpc.stub.AbstractStub.<T>withOption(io.grpc.CallOptions.Key<T>,T)
io.grpc.stub.AbstractStub.build(io.grpc.Channel,io.grpc.CallOptions)
io.grpc.stub.AbstractStub.getCallOptions()
io.grpc.stub.AbstractStub.getChannel()
io.grpc.stub.AbstractStub.withCallCredentials(io.grpc.CallCredentials)
io.grpc.stub.AbstractStub.withChannel(io.grpc.Channel)
io.grpc.stub.AbstractStub.withCompression(java.lang.String)
io.grpc.stub.AbstractStub.withDeadline(io.grpc.Deadline)
io.grpc.stub.AbstractStub.withDeadlineAfter(long,java.util.concurrent.TimeUnit)
io.grpc.stub.AbstractStub.withExecutor(java.util.concurrent.Executor)
io.grpc.stub.AbstractStub.withInterceptors(io.grpc.ClientInterceptor...)
io.grpc.stub.AbstractStub.withMaxInboundMessageSize(int)
io.grpc.stub.AbstractStub.withMaxOutboundMessageSize(int)
io.grpc.stub.AbstractStub.withOnReadyThreshold(int)
io.grpc.stub.AbstractStub.withWaitForReady()
Methods
protected PredictionServiceGrpc.PredictionServiceFutureStub build(Channel channel, CallOptions callOptions)
Parameters |
Name |
Description |
channel |
io.grpc.Channel
|
callOptions |
io.grpc.CallOptions
|
Overrides
io.grpc.stub.AbstractStub.build(io.grpc.Channel,io.grpc.CallOptions)
public ListenableFuture<CountTokensResponse> countTokens(CountTokensRequest request)
Perform a token counting.
Returns |
Type |
Description |
com.google.common.util.concurrent.ListenableFuture<CountTokensResponse> |
|
public ListenableFuture<DirectPredictResponse> directPredict(DirectPredictRequest request)
Perform an unary online prediction request to a gRPC model server for
Vertex first-party products and frameworks.
public ListenableFuture<DirectRawPredictResponse> directRawPredict(DirectRawPredictRequest request)
Perform an unary online prediction request to a gRPC model server for
custom containers.
public ListenableFuture<ExplainResponse> explain(ExplainRequest request)
Perform an online explanation.
If
deployed_model_id
is specified, the corresponding DeployModel must have
explanation_spec
populated. If
deployed_model_id
is not specified, all DeployedModels must have
explanation_spec
populated.
Returns |
Type |
Description |
com.google.common.util.concurrent.ListenableFuture<ExplainResponse> |
|
generateContent(GenerateContentRequest request)
public ListenableFuture<GenerateContentResponse> generateContent(GenerateContentRequest request)
Generate content with multimodal inputs.
public ListenableFuture<PredictResponse> predict(PredictRequest request)
Perform an online prediction.
Returns |
Type |
Description |
com.google.common.util.concurrent.ListenableFuture<PredictResponse> |
|
public ListenableFuture<HttpBody> rawPredict(RawPredictRequest request)
Perform an online prediction with an arbitrary HTTP payload.
The response includes the following HTTP headers:
X-Vertex-AI-Endpoint-Id
: ID of the
Endpoint that served this
prediction.
X-Vertex-AI-Deployed-Model-Id
: ID of the Endpoint's
DeployedModel that served
this prediction.
Returns |
Type |
Description |
com.google.common.util.concurrent.ListenableFuture<com.google.api.HttpBody> |
|