Class AutoscalingMetricSpec (1.36.4)

AutoscalingMetricSpec(mapping=None, *, ignore_unknown_fields=False, **kwargs)

The metric specification that defines the target resource utilization (CPU utilization, accelerator's duty cycle, and so on) for calculating the desired replica count.

Attributes

NameDescription
metric_name str
Required. The resource metric name. Supported metrics: - For Online Prediction: - aiplatform.googleapis.com/prediction/online/accelerator/duty_cycle - aiplatform.googleapis.com/prediction/online/cpu/utilization
target int
The target resource utilization in percentage (1% - 100%) for the given metric; once the real usage deviates from the target by a certain percentage, the machine replicas change. The default value is 60 (representing 60%) if not provided.

Methods

AutoscalingMetricSpec

AutoscalingMetricSpec(mapping=None, *, ignore_unknown_fields=False, **kwargs)

The metric specification that defines the target resource utilization (CPU utilization, accelerator's duty cycle, and so on) for calculating the desired replica count.