After your AutoML text classification model is done training, use the Vertex AI console to create an endpoint and deploy your model to the endpoint. After your model is deployed to the endpoint, send a document to the model for label prediction.
This tutorial has several pages:
Deploy model to an endpoint and send a prediction.
Each page assumes that you have already performed the instructions from the previous pages of the tutorial.
Deploy your model to an endpoint
Access your trained model to deploy it to a new endpoint from the Model Registry page.
In the Google Cloud console, go to the Model Registry page.
For Region, select us-central1 (Iowa).
Click the name and version number of your trained AutoML model to view details about your model.
For example, in the Evaluate tab, you can view your model's performance metrics.
Select the Deploy & test tab to create an endpoint.
Click Deploy to endpoint.
In the Deploy to endpoint window, complete the following steps:
Choose
Create new endpoint and enter a name for the endpoint such ashello_automl_text
.Accept the Traffic split of 100%, and click Deploy.
It takes several minutes to create the endpoint and deploy the AutoML model to the new endpoint.
Send a prediction to your model
After the endpoint is created, you can send text predictions from the Vertex AI console.
In the Google Cloud console, go to the Model Registry page.
For Region, select us-central1 (Iowa).
Click your trained AutoML model.
Select the Deploy & test tab
In the Test your model section, enter text for prediction.
Click Predict to view the model's predicted label and confidence score.
What's next
- To clean up resources that you have created, follow the last page of the tutorial