FetchBenchmarkingDataRequest(
mapping=None, *, ignore_unknown_fields=False, **kwargs
)Request message for GkeInferenceQuickstart.FetchBenchmarkingData.
Attributes |
|
|---|---|
| Name | Description |
model_server_info |
google.cloud.gkerecommender_v1.types.ModelServerInfo
Required. The model server configuration to get benchmarking data for. Use GkeInferenceQuickstart.FetchProfiles to find valid configurations. |
instance_type |
str
Optional. The instance type to filter benchmarking data. Instance types are in the format a2-highgpu-1g. If not
provided, all instance types for the given profile's
model_server_info will be returned. Use
GkeInferenceQuickstart.FetchProfiles
to find available instance types.
|
pricing_model |
str
Optional. The pricing model to use for the benchmarking data. Defaults to spot.
|