Specification of a single machine.
machineType
string
Immutable. The type of the machine.
See the list of machine types supported for prediction
See the list of machine types supported for custom training.
For DeployedModel
this field is optional, and the default value is n1-standard-2
. For BatchPredictionJob
or as part of WorkerPoolSpec
this field is required.
Immutable. The type of accelerator(s) that may be attached to the machine as per acceleratorCount
.
acceleratorCount
integer
The number of accelerators to attach to the machine.
tpuTopology
string
Immutable. The topology of the TPUs. Corresponds to the TPU topologies available from GKE. (Example: tpuTopology: "2x2x1").
Optional. Immutable. Configuration controlling how this resource pool consumes reservation.
JSON representation |
---|
{ "machineType": string, "acceleratorType": enum ( |
AcceleratorType
Represents a hardware accelerator type.
Enums | |
---|---|
ACCELERATOR_TYPE_UNSPECIFIED |
Unspecified accelerator type, which means no accelerator. |
NVIDIA_TESLA_K80 |
Deprecated: Nvidia Tesla K80 GPU has reached end of support, see https://cloud.google.com/compute/docs/eol/k80-eol. |
NVIDIA_TESLA_P100 |
Nvidia Tesla P100 GPU. |
NVIDIA_TESLA_V100 |
Nvidia Tesla V100 GPU. |
NVIDIA_TESLA_P4 |
Nvidia Tesla P4 GPU. |
NVIDIA_TESLA_T4 |
Nvidia Tesla T4 GPU. |
NVIDIA_TESLA_A100 |
Nvidia Tesla A100 GPU. |
NVIDIA_A100_80GB |
Nvidia A100 80GB GPU. |
NVIDIA_L4 |
Nvidia L4 GPU. |
NVIDIA_H100_80GB |
Nvidia H100 80Gb GPU. |
NVIDIA_H100_MEGA_80GB |
Nvidia H100 Mega 80Gb GPU. |
TPU_V2 |
TPU v2. |
TPU_V3 |
TPU v3. |
TPU_V4_POD |
TPU v4. |
TPU_V5_LITEPOD |
TPU v5. |
ReservationAffinity
A ReservationAffinity can be used to configure a Vertex AI resource (e.g., a DeployedModel) to draw its Compute Engine resources from a Shared Reservation, or exclusively from on-demand capacity.
Required. Specifies the reservation affinity type.
key
string
Optional. Corresponds to the label key of a reservation resource. To target a SPECIFIC_RESERVATION by name, use compute.googleapis.com/reservation-name
as the key and specify the name of your reservation as its value.
values[]
string
Optional. Corresponds to the label values of a reservation resource. This must be the full resource name of the reservation.
JSON representation |
---|
{
"reservationAffinityType": enum ( |
Type
Identifies a type of reservation affinity.
Enums | |
---|---|
TYPE_UNSPECIFIED |
Default value. This should not be used. |
NO_RESERVATION |
Do not consume from any reserved capacity, only use on-demand. |
ANY_RESERVATION |
Consume any reservation available, falling back to on-demand. |
SPECIFIC_RESERVATION |
Consume from a specific reservation. When chosen, the reservation must be identified via the key and values fields. |