- Resource: Version
- Methods
Resource: Version
Represents a version of the model.
Each version is a trained model deployed in the cloud, ready to handle prediction requests. A model can have multiple versions. You can get information about all of the versions of a given model by calling projects.models.versions.list
.
JSON representation | |
---|---|
{ "name": string, "description": string, "isDefault": boolean, "deploymentUri": string, "createTime": string, "lastUseTime": string, "runtimeVersion": string, "machineType": string, "state": enum ( |
Fields | ||
---|---|---|
name |
Required. The name specified for the version when it was created. The version name must be unique within the model it is created in. |
|
description |
Optional. The description specified for the version when it was created. |
|
isDefault |
Output only. If true, this version will be used to handle prediction requests that do not specify a version. You can change the default version by calling |
|
deploymentUri |
The Cloud Storage URI of a directory containing trained model artifacts to be used to create the model version. See the guide to deploying models for more information. The total number of files under this directory must not exceed 1000. During If you specify |
|
createTime |
Output only. The time the version was created. |
|
lastUseTime |
Output only. The time the version was last used for prediction. |
|
runtimeVersion |
Required. The AI Platform runtime version to use for this deployment. For more information, see the runtime version list and how to manage runtime versions. |
|
machineType |
Optional. The type of machine on which to serve the model. Currently only applies to online prediction service. To learn about valid values for this field, read Choosing a machine type for online prediction. If this field is not specified and you are using a regional endpoint, then the machine type defaults to If this field is not specified and you are using the global endpoint ( |
|
state |
Output only. The state of a version. |
|
errorMessage |
Output only. The details of a failure or a cancellation. |
|
packageUris[] |
Optional. Cloud Storage paths ( For a custom prediction routine, one of these packages must contain your Predictor class (see If you specify this field, you must also set |
|
labels |
Optional. One or more labels that you can add, to organize your model versions. Each label is a key-value pair, where both the key and the value are arbitrary strings that you supply. For more information, see the documentation on using labels. An object containing a list of |
|
etag |
A base64-encoded string. |
|
framework |
Optional. The machine learning framework AI Platform uses to train this version of the model. Valid values are Do not specify a framework if you're deploying a custom prediction routine or if you're using a custom container. |
|
pythonVersion |
Required. The version of Python used in prediction. The following Python versions are available:
Read more about the Python versions available for each runtime version. |
|
acceleratorConfig |
Optional. Accelerator config for using GPUs for online prediction (beta). Only specify this field if you have specified a Compute Engine (N1) machine type in the |
|
serviceAccount |
Optional. Specifies the service account for resource access control. If you specify this field, then you must also specify either the Learn more about using a custom service account. |
|
requestLoggingConfig |
Optional. Only specify this field in a Configures the request-response pair logging on predictions from this Version. |
|
explanationConfig |
Optional. Configures explainability features on the model's version. Some explanation features require additional metadata to be loaded as part of the model payload. |
|
container |
Optional. Specifies a custom container to use for serving predictions. If you specify this field, then If you specify this field, then If you specify this field, then you must not specify |
|
routes |
Optional. Specifies paths on a custom container's HTTP server where AI Platform Prediction sends certain requests. If you specify this field, then you must also specify the If you specify the
See |
|
lastMigrationTime |
Output only. The last time this version was successfully migrated to AI Platform (Unified). |
|
lastMigrationModelId |
Output only. The AI Platform (Unified) |
|
Union field scaling . Optional. Sets the options for scaling. If not specified, defaults to auto_scaling with min_nodes of 0 (see doc for AutoScaling.min_nodes ) scaling can be only one of the following: |
||
autoScaling |
Automatically scale the number of nodes used to serve the model in response to increases and decreases in traffic. Care should be taken to ramp up traffic according to the model's ability to scale or you will start seeing increases in latency and 429 response codes. |
|
manualScaling |
Manually select the number of nodes to use for serving the model. You should generally use |
|
predictionClass |
Optional. The fully qualified name (MODULE_NAME.CLASS_NAME) of a class that implements the Predictor interface described in this reference field. The module containing this class should be included in a package provided to the Specify this field if and only if you are deploying a custom prediction routine (beta). If you specify this field, you must set The following code sample provides the Predictor interface: class Predictor(object): """Interface for constructing custom predictors.""" def predict(self, instances, **kwargs): """Performs custom prediction. Instances are the decoded values from the request. They have already been deserialized from JSON. Args: instances: A list of prediction input instances. **kwargs: A dictionary of keyword args provided as additional fields on the predict request body. Returns: A list of outputs containing the prediction results. This list must be JSON serializable. """ raise NotImplementedError() @classmethod def from_path(cls, model_dir): """Creates an instance of Predictor using the given path. Loading of the predictor should be done in this method. Args: model_dir: The local directory that contains the exported model file along with any additional files uploaded when creating the version resource. Returns: An instance implementing this Predictor class. """ raise NotImplementedError() Learn more about the Predictor interface and custom prediction routines. |
AutoScaling
Options for automatically scaling a model.
JSON representation | |
---|---|
{
"minNodes": integer,
"maxNodes": integer,
"metrics": [
{
object ( |
Fields | |
---|---|
minNodes |
Optional. The minimum number of nodes to allocate for this model. These nodes are always up, starting from the time the model is deployed. Therefore, the cost of operating this model will be at least Unlike manual scaling, if the load gets too heavy for the nodes that are up, the service will automatically add nodes to handle the increased load as well as scale back as traffic drops, always maintaining at least If If You can set update_body.json: { 'autoScaling': { 'minNodes': 5 } } HTTP request: PATCH https://ml.googleapis.com/v1/{name=projects/*/models/*/versions/*}?updateMask=autoScaling.minNodes -d @./update_body.json |
maxNodes |
The maximum number of nodes to scale this model under load. The actual value will depend on resource quota and availability. |
metrics[] |
MetricSpec contains the specifications to use to calculate the desired nodes count. |
MetricSpec
MetricSpec contains the specifications to use to calculate the desired nodes count when autoscaling is enabled.
JSON representation | |
---|---|
{
"name": enum ( |
Fields | |
---|---|
name |
metric name. |
target |
Target specifies the target value for the given metric; once real metric deviates from the threshold by a certain percentage, the node count changes. |
MetricName
Metric Name.
Enums | |
---|---|
METRIC_NAME_UNSPECIFIED |
Unspecified MetricName. |
CPU_USAGE |
CPU usage. |
GPU_DUTY_CYCLE |
GPU duty cycle. |
ManualScaling
Options for manually scaling a model.
JSON representation | |
---|---|
{ "nodes": integer } |
Fields | |
---|---|
nodes |
The number of nodes to allocate for this model. These nodes are always up, starting from the time the model is deployed, so the cost of operating this model will be proportional to |
State
Describes the version state.
Enums | |
---|---|
UNKNOWN |
The version state is unspecified. |
READY |
The version is ready for prediction. |
CREATING |
The version is being created. New versions.patch and versions.delete requests will fail if a version is in the CREATING state. |
FAILED |
The version failed to be created, possibly cancelled. errorMessage should contain the details of the failure. |
DELETING |
The version is being deleted. New versions.patch and versions.delete requests will fail if a version is in the DELETING state. |
UPDATING |
The version is being updated. New versions.patch and versions.delete requests will fail if a version is in the UPDATING state. |
Framework
Available frameworks for prediction.
Enums | |
---|---|
FRAMEWORK_UNSPECIFIED |
Unspecified framework. Assigns a value based on the file suffix. |
TENSORFLOW |
Tensorflow framework. |
SCIKIT_LEARN |
Scikit-learn framework. |
XGBOOST |
XGBoost framework. |
AcceleratorConfig
Represents a hardware accelerator request config. Note that the AcceleratorConfig can be used in both Jobs and Versions. Learn more about accelerators for training and accelerators for online prediction.
JSON representation | |
---|---|
{
"count": string,
"type": enum ( |
Fields | |
---|---|
count |
The number of accelerators to attach to each machine running the job. |
type |
The type of accelerator to use. |
RequestLoggingConfig
Configuration for logging request-response pairs to a BigQuery table. Online prediction requests to a model version and the responses to these requests are converted to raw strings and saved to the specified BigQuery table. Logging is constrained by BigQuery quotas and limits. If your project exceeds BigQuery quotas or limits, AI Platform Prediction does not log request-response pairs, but it continues to serve predictions.
If you are using continuous evaluation, you do not need to specify this configuration manually. Setting up continuous evaluation automatically enables logging of request-response pairs.
JSON representation | |
---|---|
{ "samplingPercentage": number, "bigqueryTableName": string } |
Fields | ||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
samplingPercentage |
Percentage of requests to be logged, expressed as a fraction from 0 to 1. For example, if you want to log 10% of requests, enter |
|||||||||||||||||||||
bigqueryTableName |
Required. Fully qualified BigQuery table name in the following format: "projectId.DATASET_NAME.TABLE_NAME" The specified table must already exist, and the "Cloud ML Service Agent" for your project must have permission to write to it. The table must have the following schema:
|
ExplanationConfig
Message holding configuration options for explaining model predictions. There are three feature attribution methods supported for TensorFlow models: integrated gradients, sampled Shapley, and XRAI. Learn more about feature attributions.
JSON representation | |
---|---|
{ // Union field |
Fields | ||
---|---|---|
Union field attribution_method . The attribution method to enable for explaining the model's predictions. attribution_method can be only one of the following: |
||
integratedGradientsAttribution |
Attributes credit by computing the Aumann-Shapley value taking advantage of the model's fully differentiable structure. Refer to this paper for more details: https://arxiv.org/abs/1703.01365 |
|
sampledShapleyAttribution |
An attribution method that approximates Shapley values for features that contribute to the label being predicted. A sampling strategy is used to approximate the value rather than considering all subsets of features. |
|
xraiAttribution |
Attributes credit by computing the XRAI taking advantage of the model's fully differentiable structure. Refer to this paper for more details: https://arxiv.org/abs/1906.02825 Currently only implemented for models with natural image inputs. |
IntegratedGradientsAttribution
Attributes credit by computing the Aumann-Shapley value taking advantage of the model's fully differentiable structure. Refer to this paper for more details: https://arxiv.org/abs/1703.01365
JSON representation | |
---|---|
{ "numIntegralSteps": integer } |
Fields | |
---|---|
numIntegralSteps |
Number of steps for approximating the path integral. A good value to start is 50 and gradually increase until the sum to diff property is met within the desired error range. |
SampledShapleyAttribution
An attribution method that approximates Shapley values for features that contribute to the label being predicted. A sampling strategy is used to approximate the value rather than considering all subsets of features.
JSON representation | |
---|---|
{ "numPaths": integer } |
Fields | |
---|---|
numPaths |
The number of feature permutations to consider when approximating the Shapley values. |
XraiAttribution
Attributes credit by computing the XRAI taking advantage of the model's fully differentiable structure. Refer to this paper for more details: https://arxiv.org/abs/1906.02825 Currently only implemented for models with natural image inputs.
JSON representation | |
---|---|
{ "numIntegralSteps": integer } |
Fields | |
---|---|
numIntegralSteps |
Number of steps for approximating the path integral. A good value to start is 50 and gradually increase until the sum to diff property is met within the desired error range. |
ContainerSpec
Specification of a custom container for serving predictions. This message is a subset of the Kubernetes Container v1 core specification.
JSON representation | |
---|---|
{ "image": string, "command": [ string ], "args": [ string ], "ports": [ { object ( |
Fields | |
---|---|
image |
URI of the Docker image to be used as the custom container for serving predictions. This URI must identify an image in Artifact Registry and begin with the hostname To use a custom container, the AI Platform Google-managed service account must have permission to pull (read) the Docker image at this URI. The AI Platform Google-managed service account has the following format:
{PROJECT_NUMBER} is replaced by your Google Cloud project number. By default, this service account has necessary permissions to pull an Artifact Registry image in the same Google Cloud project where you are using AI Platform Prediction. In this case, no configuration is necessary. If you want to use an image from a different Google Cloud project, learn how to grant the Artifact Registry Reader (roles/artifactregistry.reader) role for a repository to your projet's AI Platform Google-managed service account. To learn about the requirements for the Docker image itself, read Custom container requirements. |
command[] |
Immutable. Specifies the command that runs when the container starts. This overrides the container's If you do not specify this field, then the container's If you specify this field, then you can also specify the In this field, you can reference environment variables set by AI Platform Prediction and environment variables set in the
Note that this differs from Bash variable expansion, which does not use parentheses. If a variable cannot be resolved, the reference in the input string is used unchanged. To avoid variable expansion, you can escape this syntax with
This field corresponds to the |
args[] |
Immutable. Specifies arguments for the command that runs when the container starts. This overrides the container's If you don't specify this field but do specify the If you don't specify this field and don't specify the In this field, you can reference environment variables set by AI Platform Prediction and environment variables set in the
Note that this differs from Bash variable expansion, which does not use parentheses. If a variable cannot be resolved, the reference in the input string is used unchanged. To avoid variable expansion, you can escape this syntax with
This field corresponds to the |
ports[] |
Immutable. List of ports to expose from the container. AI Platform Prediction sends any prediction requests that it receives to the first port on this list. AI Platform Prediction also sends liveness and health checks to this port. If you do not specify this field, it defaults to following value:
AI Platform Prediction does not use ports other than the first one listed. This field corresponds to the |
env[] |
Immutable. List of environment variables to set in the container. After the container starts running, code running in the container can read these environment variables. Additionally, the
If you switch the order of the variables in the example, then the expansion does not occur. This field corresponds to the |
ContainerPort
Represents a network port in a single container.
This message is a subset of the Kubernetes ContainerPort v1 core specification.
JSON representation | |
---|---|
{ "containerPort": integer } |
Fields | |
---|---|
containerPort |
Number of the port to expose on the container. This must be a valid port number: 0 < PORT_NUMBER < 65536. |
EnvVar
Represents an environment variable to be made available in a container.
This message is a subset of the Kubernetes EnvVar v1 core specification.
JSON representation | |
---|---|
{ "name": string, "value": string } |
Fields | |
---|---|
name |
Name of the environment variable. Must be a valid C identifier and must not begin with the prefix |
value |
Value of the environment variable. Defaults to an empty string. In this field, you can reference environment variables set by AI Platform Prediction and environment variables set earlier in the same
Note that this differs from Bash variable expansion, which does not use parentheses. If a variable cannot be resolved, the reference in the input string is used unchanged. To avoid variable expansion, you can escape this syntax with
|
RouteMap
Specifies HTTP paths served by a custom container. AI Platform Prediction sends requests to these paths on the container; the custom container must run an HTTP server that responds to these requests with appropriate responses. Read Custom container requirements for details on how to create your container image to meet these requirements.
JSON representation | |
---|---|
{ "predict": string, "health": string } |
Fields | |
---|---|
predict |
HTTP path on the container to send prediction requests to. AI Platform Prediction forwards requests sent using For example, if you set this field to If you don't specify this field, it defaults to the following value:
The placeholders in this value are replaced as follows:
|
health |
HTTP path on the container to send health checkss to. AI Platform Prediction intermittently sends GET requests to this path on the container's IP address and port to check that the container is healthy. Read more about health checks. For example, if you set this field to If you don't specify this field, it defaults to the following value:
The placeholders in this value are replaced as follows:
|
Methods |
|
---|---|
|
Creates a new version of a model from a trained TensorFlow model. |
|
Deletes a model version. |
|
Gets information about a model version. |
|
Gets basic information about all the versions of a model. |
|
Updates the specified Version resource. |
|
Designates a version to be the default for the model. |