- NAME
-
- gcloud alpha ai endpoints predict - run Vertex AI online prediction
- SYNOPSIS
-
-
gcloud alpha ai endpoints predict
(ENDPOINT
:--region
=REGION
)--json-request
=JSON_REQUEST
[GCLOUD_WIDE_FLAG …
]
-
- DESCRIPTION
-
(ALPHA)
gcloud alpha ai endpoints predict
sends a prediction request to Vertex AI endpoint for the given instances. This command will read up to 100 instances, though the service itself will accept instances up to the payload limit size (currently, 1.5MB). - EXAMPLES
-
To predict against an endpoint
under project123
in regionexample
, run:us-central1
gcloud alpha ai endpoints predict 123 --project=example --region=us-central1 --json-request=input.json
- POSITIONAL ARGUMENTS
-
-
Endpoint resource - The endpoint to do online prediction. The arguments in this
group can be used to specify the attributes of this resource. (NOTE) Some
attributes are not given arguments in this group but can be set in other ways.
To set the
project
attribute:-
provide the argument
endpoint
on the command line with a fully specified name; -
provide the argument
--project
on the command line; -
set the property
core/project
.
This must be specified.
ENDPOINT
-
ID of the endpoint or fully qualified identifier for the endpoint.
To set the
name
attribute:-
provide the argument
endpoint
on the command line.
This positional argument must be specified if any of the other arguments in this group are specified.
-
provide the argument
--region
=REGION
-
Cloud region for the endpoint.
To set the
region
attribute:-
provide the argument
endpoint
on the command line with a fully specified name; -
provide the argument
--region
on the command line; -
set the property
ai/region
; - choose one from the prompted list of available regions.
-
provide the argument
-
provide the argument
-
Endpoint resource - The endpoint to do online prediction. The arguments in this
group can be used to specify the attributes of this resource. (NOTE) Some
attributes are not given arguments in this group but can be set in other ways.
- REQUIRED FLAGS
-
--json-request
=JSON_REQUEST
-
Path to a local file containing the body of a JSON request.
An example of a JSON request:
{ "instances": [ {"x": [1, 2], "y": [3, 4]}, {"x": [-1, -2], "y": [-3, -4]} ] }
This flag accepts "-" for stdin.
- GCLOUD WIDE FLAGS
-
These flags are available to all commands:
--access-token-file
,--account
,--billing-project
,--configuration
,--flags-file
,--flatten
,--format
,--help
,--impersonate-service-account
,--log-http
,--project
,--quiet
,--trace-token
,--user-output-enabled
,--verbosity
.Run
$ gcloud help
for details. - NOTES
-
This command is currently in alpha and might change without notice. If this
command fails with API permission errors despite specifying the correct project,
you might be trying to access an API with an invitation-only early access
allowlist. These variants are also available:
gcloud ai endpoints predict
gcloud beta ai endpoints predict
Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.
Last updated 2024-02-06 UTC.