Class AutoscalingMetricSpec (1.18.0)

Stay organized with collections Save and categorize content based on your preferences.
AutoscalingMetricSpec(mapping=None, *, ignore_unknown_fields=False, **kwargs)

The metric specification that defines the target resource utilization (CPU utilization, accelerator's duty cycle, and so on) for calculating the desired replica count.

Attributes

NameDescription
metric_name str
Required. The resource metric name. Supported metrics: - For Online Prediction: - ``aiplatform.googleapis.com/prediction/online/accelerator/duty_cycle`` - ``aiplatform.googleapis.com/prediction/online/cpu/utilization``
target int
The target resource utilization in percentage (1% - 100%) for the given metric; once the real usage deviates from the target by a certain percentage, the machine replicas change. The default value is 60 (representing 60%) if not provided.

Inheritance

builtins.object > proto.message.Message > AutoscalingMetricSpec