Class FetchModelServerVersionsRequest (0.1.0)

FetchModelServerVersionsRequest(
    mapping=None, *, ignore_unknown_fields=False, **kwargs
)

Request message for GkeInferenceQuickstart.FetchModelServerVersions.

.. _oneof: https://proto-plus-python.readthedocs.io/en/stable/fields.html#oneofs-mutually-exclusive-fields

Attributes

Name Description
model str
Required. The model for which to list model server versions. Open-source models follow the Huggingface Hub owner/model_name format. Use GkeInferenceQuickstart.FetchModels to find available models.
model_server str
Required. The model server for which to list versions. Open-source model servers use simplified, lowercase names (e.g., vllm). Use GkeInferenceQuickstart.FetchModelServers to find available model servers.
page_size int
Optional. The target number of results to return in a single response. If not specified, a default value will be chosen by the service. Note that the response may include a partial list and a caller should only rely on the response's next_page_token to determine if there are more instances left to be queried. This field is a member of oneof_ _page_size.
page_token str
Optional. The value of next_page_token received from a previous FetchModelServerVersionsRequest call. Provide this to retrieve the subsequent page in a multi-page list of results. When paginating, all other parameters provided to FetchModelServerVersionsRequest must match the call that provided the page token. This field is a member of oneof_ _page_token.