public static final class PredictionServiceGrpc.PredictionServiceFutureStub extends AbstractFutureStub<PredictionServiceGrpc.PredictionServiceFutureStub>
A stub to allow clients to do ListenableFuture-style rpc calls to service PredictionService.
A service for online predictions and explanations.
Inheritance
java.lang.Object > io.grpc.stub.AbstractStub > io.grpc.stub.AbstractFutureStub > PredictionServiceGrpc.PredictionServiceFutureStubMethods
build(Channel channel, CallOptions callOptions)
protected PredictionServiceGrpc.PredictionServiceFutureStub build(Channel channel, CallOptions callOptions)
Parameters | |
---|---|
Name | Description |
channel |
io.grpc.Channel |
callOptions |
io.grpc.CallOptions |
Returns | |
---|---|
Type | Description |
PredictionServiceGrpc.PredictionServiceFutureStub |
countTokens(CountTokensRequest request)
public ListenableFuture<CountTokensResponse> countTokens(CountTokensRequest request)
Perform a token counting.
Parameter | |
---|---|
Name | Description |
request |
CountTokensRequest |
Returns | |
---|---|
Type | Description |
com.google.common.util.concurrent.ListenableFuture<CountTokensResponse> |
directPredict(DirectPredictRequest request)
public ListenableFuture<DirectPredictResponse> directPredict(DirectPredictRequest request)
Perform an unary online prediction request for Vertex first-party products and frameworks.
Parameter | |
---|---|
Name | Description |
request |
DirectPredictRequest |
Returns | |
---|---|
Type | Description |
com.google.common.util.concurrent.ListenableFuture<DirectPredictResponse> |
directRawPredict(DirectRawPredictRequest request)
public ListenableFuture<DirectRawPredictResponse> directRawPredict(DirectRawPredictRequest request)
Perform an online prediction request through gRPC.
Parameter | |
---|---|
Name | Description |
request |
DirectRawPredictRequest |
Returns | |
---|---|
Type | Description |
com.google.common.util.concurrent.ListenableFuture<DirectRawPredictResponse> |
explain(ExplainRequest request)
public ListenableFuture<ExplainResponse> explain(ExplainRequest request)
Perform an online explanation. If deployed_model_id is specified, the corresponding DeployModel must have explanation_spec populated. If deployed_model_id is not specified, all DeployedModels must have explanation_spec populated.
Parameter | |
---|---|
Name | Description |
request |
ExplainRequest |
Returns | |
---|---|
Type | Description |
com.google.common.util.concurrent.ListenableFuture<ExplainResponse> |
predict(PredictRequest request)
public ListenableFuture<PredictResponse> predict(PredictRequest request)
Perform an online prediction.
Parameter | |
---|---|
Name | Description |
request |
PredictRequest |
Returns | |
---|---|
Type | Description |
com.google.common.util.concurrent.ListenableFuture<PredictResponse> |
rawPredict(RawPredictRequest request)
public ListenableFuture<HttpBody> rawPredict(RawPredictRequest request)
Perform an online prediction with an arbitrary HTTP payload. The response includes the following HTTP headers:
X-Vertex-AI-Endpoint-Id
: ID of the Endpoint that served this prediction.X-Vertex-AI-Deployed-Model-Id
: ID of the Endpoint's DeployedModel that served this prediction.
Parameter | |
---|---|
Name | Description |
request |
RawPredictRequest |
Returns | |
---|---|
Type | Description |
com.google.common.util.concurrent.ListenableFuture<com.google.api.HttpBody> |