A service for managing Vertex AI's Endpoints.
Equality
Instances of this class created via copy-construction or copy-assignment always compare equal. Instances created with equal std::shared_ptr<*Connection>
objects compare equal. Objects that compare equal share the same underlying resources.
Performance
Creating a new instance of this class is a relatively expensive operation, new objects establish new connections to the service. In contrast, copy-construction, move-construction, and the corresponding assignment operations are relatively efficient as the copies share all underlying resources.
Thread Safety
Concurrent access to different instances of this class, even if they compare equal, is guaranteed to work. Two or more threads operating on the same instance of this class is not guaranteed to work. Since copy-construction and move-construction is a relatively efficient operation, consider using such a copy when using this class from multiple threads.
Constructors
EndpointServiceClient(EndpointServiceClient const &)
Copy and move support
Parameter | |
---|---|
Name | Description |
|
EndpointServiceClient const &
|
EndpointServiceClient(EndpointServiceClient &&)
Copy and move support
Parameter | |
---|---|
Name | Description |
|
EndpointServiceClient &&
|
EndpointServiceClient(std::shared_ptr< EndpointServiceConnection >, Options)
Parameters | |
---|---|
Name | Description |
connection |
std::shared_ptr< EndpointServiceConnection >
|
opts |
Options
|
Operators
operator=(EndpointServiceClient const &)
Copy and move support
Parameter | |
---|---|
Name | Description |
|
EndpointServiceClient const &
|
Returns | |
---|---|
Type | Description |
EndpointServiceClient & |
operator=(EndpointServiceClient &&)
Copy and move support
Parameter | |
---|---|
Name | Description |
|
EndpointServiceClient &&
|
Returns | |
---|---|
Type | Description |
EndpointServiceClient & |
Functions
CreateEndpoint(std::string const &, google::cloud::aiplatform::v1::Endpoint const &, Options)
Creates an Endpoint.
Parameters | |
---|---|
Name | Description |
parent |
std::string const &
Required. The resource name of the Location to create the Endpoint in. Format: |
endpoint |
google::cloud::aiplatform::v1::Endpoint const &
Required. The Endpoint to create. |
opts |
Options
Optional. Override the class-level options, such as retry and backoff policies. |
Returns | |
---|---|
Type | Description |
future< StatusOr< google::cloud::aiplatform::v1::Endpoint > > |
A |
CreateEndpoint(NoAwaitTag, std::string const &, google::cloud::aiplatform::v1::Endpoint const &, Options)
Creates an Endpoint.
Specifying the NoAwaitTag
immediately returns the [google::longrunning::Operation
] that corresponds to the Long Running Operation that has been started. No polling for operation status occurs.
Parameters | |
---|---|
Name | Description |
|
NoAwaitTag
|
parent |
std::string const &
|
endpoint |
google::cloud::aiplatform::v1::Endpoint const &
|
opts |
Options
|
Returns | |
---|---|
Type | Description |
StatusOr< google::longrunning::Operation > |
CreateEndpoint(std::string const &, google::cloud::aiplatform::v1::Endpoint const &, std::string const &, Options)
Creates an Endpoint.
Parameters | |
---|---|
Name | Description |
parent |
std::string const &
Required. The resource name of the Location to create the Endpoint in. Format: |
endpoint |
google::cloud::aiplatform::v1::Endpoint const &
Required. The Endpoint to create. |
endpoint_id |
std::string const &
Immutable. The ID to use for endpoint, which will become the final component of the endpoint resource name. If not provided, Vertex AI will generate a value for this ID. |
opts |
Options
Optional. Override the class-level options, such as retry and backoff policies. |
Returns | |
---|---|
Type | Description |
future< StatusOr< google::cloud::aiplatform::v1::Endpoint > > |
A |
CreateEndpoint(NoAwaitTag, std::string const &, google::cloud::aiplatform::v1::Endpoint const &, std::string const &, Options)
Creates an Endpoint.
Specifying the NoAwaitTag
immediately returns the [google::longrunning::Operation
] that corresponds to the Long Running Operation that has been started. No polling for operation status occurs.
Parameters | |
---|---|
Name | Description |
|
NoAwaitTag
|
parent |
std::string const &
|
endpoint |
google::cloud::aiplatform::v1::Endpoint const &
|
endpoint_id |
std::string const &
|
opts |
Options
|
Returns | |
---|---|
Type | Description |
StatusOr< google::longrunning::Operation > |
CreateEndpoint(google::cloud::aiplatform::v1::CreateEndpointRequest const &, Options)
Creates an Endpoint.
Parameters | |
---|---|
Name | Description |
request |
google::cloud::aiplatform::v1::CreateEndpointRequest const &
Unary RPCs, such as the one wrapped by this function, receive a single |
opts |
Options
Optional. Override the class-level options, such as retry and backoff policies. |
Returns | |
---|---|
Type | Description |
future< StatusOr< google::cloud::aiplatform::v1::Endpoint > > |
A |
CreateEndpoint(NoAwaitTag, google::cloud::aiplatform::v1::CreateEndpointRequest const &, Options)
Creates an Endpoint.
Specifying the NoAwaitTag
immediately returns the [google::longrunning::Operation
] that corresponds to the Long Running Operation that has been started. No polling for operation status occurs.
Parameters | |
---|---|
Name | Description |
|
NoAwaitTag
|
request |
google::cloud::aiplatform::v1::CreateEndpointRequest const &
|
opts |
Options
|
Returns | |
---|---|
Type | Description |
StatusOr< google::longrunning::Operation > |
CreateEndpoint(google::longrunning::Operation const &, Options)
Creates an Endpoint.
This method accepts a google::longrunning::Operation
that corresponds to a previously started Long Running Operation (LRO) and polls the status of the LRO in the background.
Parameters | |
---|---|
Name | Description |
operation |
google::longrunning::Operation const &
|
opts |
Options
|
Returns | |
---|---|
Type | Description |
future< StatusOr< google::cloud::aiplatform::v1::Endpoint > > |
GetEndpoint(std::string const &, Options)
Gets an Endpoint.
Parameters | |
---|---|
Name | Description |
name |
std::string const &
Required. The name of the Endpoint resource. Format: |
opts |
Options
Optional. Override the class-level options, such as retry and backoff policies. |
Returns | |
---|---|
Type | Description |
StatusOr< google::cloud::aiplatform::v1::Endpoint > |
the result of the RPC. The response message type (google.cloud.aiplatform.v1.Endpoint) is mapped to a C++ class using the Protobuf mapping rules. If the request fails, the |
GetEndpoint(google::cloud::aiplatform::v1::GetEndpointRequest const &, Options)
Gets an Endpoint.
Parameters | |
---|---|
Name | Description |
request |
google::cloud::aiplatform::v1::GetEndpointRequest const &
Unary RPCs, such as the one wrapped by this function, receive a single |
opts |
Options
Optional. Override the class-level options, such as retry and backoff policies. |
Returns | |
---|---|
Type | Description |
StatusOr< google::cloud::aiplatform::v1::Endpoint > |
the result of the RPC. The response message type (google.cloud.aiplatform.v1.Endpoint) is mapped to a C++ class using the Protobuf mapping rules. If the request fails, the |
ListEndpoints(std::string const &, Options)
Lists Endpoints in a Location.
Parameters | |
---|---|
Name | Description |
parent |
std::string const &
Required. The resource name of the Location from which to list the Endpoints. Format: |
opts |
Options
Optional. Override the class-level options, such as retry and backoff policies. |
Returns | |
---|---|
Type | Description |
StreamRange< google::cloud::aiplatform::v1::Endpoint > |
a StreamRange to iterate of the results. See the documentation of this type for details. In brief, this class has |
ListEndpoints(google::cloud::aiplatform::v1::ListEndpointsRequest, Options)
Lists Endpoints in a Location.
Parameters | |
---|---|
Name | Description |
request |
google::cloud::aiplatform::v1::ListEndpointsRequest
Unary RPCs, such as the one wrapped by this function, receive a single |
opts |
Options
Optional. Override the class-level options, such as retry and backoff policies. |
Returns | |
---|---|
Type | Description |
StreamRange< google::cloud::aiplatform::v1::Endpoint > |
a StreamRange to iterate of the results. See the documentation of this type for details. In brief, this class has |
UpdateEndpoint(google::cloud::aiplatform::v1::Endpoint const &, google::protobuf::FieldMask const &, Options)
Updates an Endpoint.
Parameters | |
---|---|
Name | Description |
endpoint |
google::cloud::aiplatform::v1::Endpoint const &
Required. The Endpoint which replaces the resource on the server. |
update_mask |
google::protobuf::FieldMask const &
Required. The update mask applies to the resource. See google.protobuf.FieldMask. |
opts |
Options
Optional. Override the class-level options, such as retry and backoff policies. |
Returns | |
---|---|
Type | Description |
StatusOr< google::cloud::aiplatform::v1::Endpoint > |
the result of the RPC. The response message type (google.cloud.aiplatform.v1.Endpoint) is mapped to a C++ class using the Protobuf mapping rules. If the request fails, the |
UpdateEndpoint(google::cloud::aiplatform::v1::UpdateEndpointRequest const &, Options)
Updates an Endpoint.
Parameters | |
---|---|
Name | Description |
request |
google::cloud::aiplatform::v1::UpdateEndpointRequest const &
Unary RPCs, such as the one wrapped by this function, receive a single |
opts |
Options
Optional. Override the class-level options, such as retry and backoff policies. |
Returns | |
---|---|
Type | Description |
StatusOr< google::cloud::aiplatform::v1::Endpoint > |
the result of the RPC. The response message type (google.cloud.aiplatform.v1.Endpoint) is mapped to a C++ class using the Protobuf mapping rules. If the request fails, the |
DeleteEndpoint(std::string const &, Options)
Deletes an Endpoint.
Parameters | |
---|---|
Name | Description |
name |
std::string const &
Required. The name of the Endpoint resource to be deleted. Format: |
opts |
Options
Optional. Override the class-level options, such as retry and backoff policies. |
Returns | |
---|---|
Type | Description |
future< StatusOr< google::cloud::aiplatform::v1::DeleteOperationMetadata > > |
A |
DeleteEndpoint(NoAwaitTag, std::string const &, Options)
Deletes an Endpoint.
Specifying the NoAwaitTag
immediately returns the [google::longrunning::Operation
] that corresponds to the Long Running Operation that has been started. No polling for operation status occurs.
Parameters | |
---|---|
Name | Description |
|
NoAwaitTag
|
name |
std::string const &
|
opts |
Options
|
Returns | |
---|---|
Type | Description |
StatusOr< google::longrunning::Operation > |
DeleteEndpoint(google::cloud::aiplatform::v1::DeleteEndpointRequest const &, Options)
Deletes an Endpoint.
Parameters | |
---|---|
Name | Description |
request |
google::cloud::aiplatform::v1::DeleteEndpointRequest const &
Unary RPCs, such as the one wrapped by this function, receive a single |
opts |
Options
Optional. Override the class-level options, such as retry and backoff policies. |
Returns | |
---|---|
Type | Description |
future< StatusOr< google::cloud::aiplatform::v1::DeleteOperationMetadata > > |
A |
DeleteEndpoint(NoAwaitTag, google::cloud::aiplatform::v1::DeleteEndpointRequest const &, Options)
Deletes an Endpoint.
Specifying the NoAwaitTag
immediately returns the [google::longrunning::Operation
] that corresponds to the Long Running Operation that has been started. No polling for operation status occurs.
Parameters | |
---|---|
Name | Description |
|
NoAwaitTag
|
request |
google::cloud::aiplatform::v1::DeleteEndpointRequest const &
|
opts |
Options
|
Returns | |
---|---|
Type | Description |
StatusOr< google::longrunning::Operation > |
DeleteEndpoint(google::longrunning::Operation const &, Options)
Deletes an Endpoint.
This method accepts a google::longrunning::Operation
that corresponds to a previously started Long Running Operation (LRO) and polls the status of the LRO in the background.
Parameters | |
---|---|
Name | Description |
operation |
google::longrunning::Operation const &
|
opts |
Options
|
Returns | |
---|---|
Type | Description |
future< StatusOr< google::cloud::aiplatform::v1::DeleteOperationMetadata > > |
DeployModel(std::string const &, google::cloud::aiplatform::v1::DeployedModel const &, std::map< std::string, std::int32_t > const &, Options)
Deploys a Model into this Endpoint, creating a DeployedModel within it.
Parameters | |
---|---|
Name | Description |
endpoint |
std::string const &
Required. The name of the Endpoint resource into which to deploy a Model. Format: |
deployed_model |
google::cloud::aiplatform::v1::DeployedModel const &
Required. The DeployedModel to be created within the Endpoint. Note that Endpoint.traffic_split must be updated for the DeployedModel to start receiving traffic, either as part of this call, or via EndpointService.UpdateEndpoint. |
traffic_split |
std::map< std::string, std::int32_t > const &
A map from a DeployedModel's ID to the percentage of this Endpoint's traffic that should be forwarded to that DeployedModel. |
opts |
Options
Optional. Override the class-level options, such as retry and backoff policies. |
Returns | |
---|---|
Type | Description |
future< StatusOr< google::cloud::aiplatform::v1::DeployModelResponse > > |
A |
DeployModel(NoAwaitTag, std::string const &, google::cloud::aiplatform::v1::DeployedModel const &, std::map< std::string, std::int32_t > const &, Options)
Deploys a Model into this Endpoint, creating a DeployedModel within it.
Specifying the NoAwaitTag
immediately returns the [google::longrunning::Operation
] that corresponds to the Long Running Operation that has been started. No polling for operation status occurs.
Parameters | |
---|---|
Name | Description |
|
NoAwaitTag
|
endpoint |
std::string const &
|
deployed_model |
google::cloud::aiplatform::v1::DeployedModel const &
|
traffic_split |
std::map< std::string, std::int32_t > const &
|
opts |
Options
|
Returns | |
---|---|
Type | Description |
StatusOr< google::longrunning::Operation > |
DeployModel(google::cloud::aiplatform::v1::DeployModelRequest const &, Options)
Deploys a Model into this Endpoint, creating a DeployedModel within it.
Parameters | |
---|---|
Name | Description |
request |
google::cloud::aiplatform::v1::DeployModelRequest const &
Unary RPCs, such as the one wrapped by this function, receive a single |
opts |
Options
Optional. Override the class-level options, such as retry and backoff policies. |
Returns | |
---|---|
Type | Description |
future< StatusOr< google::cloud::aiplatform::v1::DeployModelResponse > > |
A |
DeployModel(NoAwaitTag, google::cloud::aiplatform::v1::DeployModelRequest const &, Options)
Deploys a Model into this Endpoint, creating a DeployedModel within it.
Specifying the NoAwaitTag
immediately returns the [google::longrunning::Operation
] that corresponds to the Long Running Operation that has been started. No polling for operation status occurs.
Parameters | |
---|---|
Name | Description |
|
NoAwaitTag
|
request |
google::cloud::aiplatform::v1::DeployModelRequest const &
|
opts |
Options
|
Returns | |
---|---|
Type | Description |
StatusOr< google::longrunning::Operation > |
DeployModel(google::longrunning::Operation const &, Options)
Deploys a Model into this Endpoint, creating a DeployedModel within it.
This method accepts a google::longrunning::Operation
that corresponds to a previously started Long Running Operation (LRO) and polls the status of the LRO in the background.
Parameters | |
---|---|
Name | Description |
operation |
google::longrunning::Operation const &
|
opts |
Options
|
Returns | |
---|---|
Type | Description |
future< StatusOr< google::cloud::aiplatform::v1::DeployModelResponse > > |
UndeployModel(std::string const &, std::string const &, std::map< std::string, std::int32_t > const &, Options)
Undeploys a Model from an Endpoint, removing a DeployedModel from it, and freeing all resources it's using.
Parameters | |
---|---|
Name | Description |
endpoint |
std::string const &
Required. The name of the Endpoint resource from which to undeploy a Model. Format: |
deployed_model_id |
std::string const &
Required. The ID of the DeployedModel to be undeployed from the Endpoint. |
traffic_split |
std::map< std::string, std::int32_t > const &
If this field is provided, then the Endpoint's traffic_split will be overwritten with it. If last DeployedModel is being undeployed from the Endpoint, the [Endpoint.traffic_split] will always end up empty when this call returns. A DeployedModel will be successfully undeployed only if it doesn't have any traffic assigned to it when this method executes, or if this field unassigns any traffic to it. |
opts |
Options
Optional. Override the class-level options, such as retry and backoff policies. |
Returns | |
---|---|
Type | Description |
future< StatusOr< google::cloud::aiplatform::v1::UndeployModelResponse > > |
A |
UndeployModel(NoAwaitTag, std::string const &, std::string const &, std::map< std::string, std::int32_t > const &, Options)
Undeploys a Model from an Endpoint, removing a DeployedModel from it, and freeing all resources it's using.
Specifying the NoAwaitTag
immediately returns the [google::longrunning::Operation
] that corresponds to the Long Running Operation that has been started. No polling for operation status occurs.
Parameters | |
---|---|
Name | Description |
|
NoAwaitTag
|
endpoint |
std::string const &
|
deployed_model_id |
std::string const &
|
traffic_split |
std::map< std::string, std::int32_t > const &
|
opts |
Options
|
Returns | |
---|---|
Type | Description |
StatusOr< google::longrunning::Operation > |
UndeployModel(google::cloud::aiplatform::v1::UndeployModelRequest const &, Options)
Undeploys a Model from an Endpoint, removing a DeployedModel from it, and freeing all resources it's using.
Parameters | |
---|---|
Name | Description |
request |
google::cloud::aiplatform::v1::UndeployModelRequest const &
Unary RPCs, such as the one wrapped by this function, receive a single |
opts |
Options
Optional. Override the class-level options, such as retry and backoff policies. |
Returns | |
---|---|
Type | Description |
future< StatusOr< google::cloud::aiplatform::v1::UndeployModelResponse > > |
A |
UndeployModel(NoAwaitTag, google::cloud::aiplatform::v1::UndeployModelRequest const &, Options)
Undeploys a Model from an Endpoint, removing a DeployedModel from it, and freeing all resources it's using.
Specifying the NoAwaitTag
immediately returns the [google::longrunning::Operation
] that corresponds to the Long Running Operation that has been started. No polling for operation status occurs.
Parameters | |
---|---|
Name | Description |
|
NoAwaitTag
|
request |
google::cloud::aiplatform::v1::UndeployModelRequest const &
|
opts |
Options
|
Returns | |
---|---|
Type | Description |
StatusOr< google::longrunning::Operation > |
UndeployModel(google::longrunning::Operation const &, Options)
Undeploys a Model from an Endpoint, removing a DeployedModel from it, and freeing all resources it's using.
This method accepts a google::longrunning::Operation
that corresponds to a previously started Long Running Operation (LRO) and polls the status of the LRO in the background.
Parameters | |
---|---|
Name | Description |
operation |
google::longrunning::Operation const &
|
opts |
Options
|
Returns | |
---|---|
Type | Description |
future< StatusOr< google::cloud::aiplatform::v1::UndeployModelResponse > > |
MutateDeployedModel(std::string const &, google::cloud::aiplatform::v1::DeployedModel const &, google::protobuf::FieldMask const &, Options)
Updates an existing deployed model.
Updatable fields include min_replica_count
, max_replica_count
, autoscaling_metric_specs
, disable_container_logging
(v1 only), and enable_container_logging
(v1beta1 only).
Parameters | |
---|---|
Name | Description |
endpoint |
std::string const &
Required. The name of the Endpoint resource into which to mutate a DeployedModel. Format: |
deployed_model |
google::cloud::aiplatform::v1::DeployedModel const &
Required. The DeployedModel to be mutated within the Endpoint. Only the following fields can be mutated:
|
update_mask |
google::protobuf::FieldMask const &
Required. The update mask applies to the resource. See google.protobuf.FieldMask. |
opts |
Options
Optional. Override the class-level options, such as retry and backoff policies. |
Returns | |
---|---|
Type | Description |
future< StatusOr< google::cloud::aiplatform::v1::MutateDeployedModelResponse > > |
A |
MutateDeployedModel(NoAwaitTag, std::string const &, google::cloud::aiplatform::v1::DeployedModel const &, google::protobuf::FieldMask const &, Options)
Updates an existing deployed model.
Specifying the NoAwaitTag
immediately returns the [google::longrunning::Operation
] that corresponds to the Long Running Operation that has been started. No polling for operation status occurs.
Parameters | |
---|---|
Name | Description |
|
NoAwaitTag
|
endpoint |
std::string const &
|
deployed_model |
google::cloud::aiplatform::v1::DeployedModel const &
|
update_mask |
google::protobuf::FieldMask const &
|
opts |
Options
|
Returns | |
---|---|
Type | Description |
StatusOr< google::longrunning::Operation > |
MutateDeployedModel(google::cloud::aiplatform::v1::MutateDeployedModelRequest const &, Options)
Updates an existing deployed model.
Updatable fields include min_replica_count
, max_replica_count
, autoscaling_metric_specs
, disable_container_logging
(v1 only), and enable_container_logging
(v1beta1 only).
Parameters | |
---|---|
Name | Description |
request |
google::cloud::aiplatform::v1::MutateDeployedModelRequest const &
Unary RPCs, such as the one wrapped by this function, receive a single |
opts |
Options
Optional. Override the class-level options, such as retry and backoff policies. |
Returns | |
---|---|
Type | Description |
future< StatusOr< google::cloud::aiplatform::v1::MutateDeployedModelResponse > > |
A |
MutateDeployedModel(NoAwaitTag, google::cloud::aiplatform::v1::MutateDeployedModelRequest const &, Options)
Updates an existing deployed model.
Specifying the NoAwaitTag
immediately returns the [google::longrunning::Operation
] that corresponds to the Long Running Operation that has been started. No polling for operation status occurs.
Parameters | |
---|---|
Name | Description |
|
NoAwaitTag
|
request |
google::cloud::aiplatform::v1::MutateDeployedModelRequest const &
|
opts |
Options
|
Returns | |
---|---|
Type | Description |
StatusOr< google::longrunning::Operation > |
MutateDeployedModel(google::longrunning::Operation const &, Options)
Updates an existing deployed model.
This method accepts a google::longrunning::Operation
that corresponds to a previously started Long Running Operation (LRO) and polls the status of the LRO in the background.
Parameters | |
---|---|
Name | Description |
operation |
google::longrunning::Operation const &
|
opts |
Options
|
Returns | |
---|---|
Type | Description |
future< StatusOr< google::cloud::aiplatform::v1::MutateDeployedModelResponse > > |