Schedule a notebook run
This page shows you how to schedule a notebook run in Colab Enterprise.
Overview
You can schedule a notebook to run immediately one time, or on a recurring schedule.
When you schedule the notebook run, you select a runtime template. Colab Enterprise uses this runtime template to create the runtime that runs your notebook.
The runtime needs specific permissions to run the notebook's code and access Google Cloud services and APIs.
If your runtime template configuration has end-user credentials enabled, then the runtime uses the permissions associated with your user credentials.
If end-user credentials aren't enabled, you must specify a service account when you schedule the notebook run. Colab Enterprise uses this service account's credentials to run your notebook.
For more information, see Required roles for running the notebook.
After Colab Enterprise completes the notebook run, the results are stored in a shareable Cloud Storage bucket.
Limitations
Colab Enterprise runtimes use Compute Engine quota. See the Compute Engine Allocation quotas page.
Before you begin
- Sign in to your Google Cloud account. If you're new to Google Cloud, create an account to evaluate how our products perform in real-world scenarios. New customers also get $300 in free credits to run, test, and deploy workloads.
-
In the Google Cloud console, on the project selector page, select or create a Google Cloud project.
-
Make sure that billing is enabled for your Google Cloud project.
-
Enable the Vertex AI, Dataform, and Compute Engine APIs.
-
In the Google Cloud console, on the project selector page, select or create a Google Cloud project.
-
Make sure that billing is enabled for your Google Cloud project.
-
Enable the Vertex AI, Dataform, and Compute Engine APIs.
Required roles for scheduling the notebook run
To ensure that your user account has the necessary permissions to schedule a notebook run in Colab Enterprise, ask your administrator to grant your user account the following IAM roles on the project:
-
Colab Enterprise User (
roles/aiplatform.colabEnterpriseUser
) -
Storage Admin (
roles/storage.admin
)
For more information about granting roles, see Manage access to projects, folders, and organizations.
Your administrator might also be able to give your user account the required permissions through custom roles or other predefined roles.
Required roles for running the notebook
The principal that runs the notebook needs specific permissions. The principal is either your user account or a service account that you specify, as described in the overview.
To ensure that the principal has the necessary permissions to run a notebook in Colab Enterprise, ask your administrator to grant the principal the following IAM roles:
-
Code Viewer (
roles/dataform.codeViewer
) on the notebook -
Logs Writer (
roles/logging.logWriter
) on the project -
Monitoring Metric Writer (
roles/monitoring.metricWriter
) on the project -
Storage Legacy Bucket Writer (
roles/storage.legacyBucketWriter
) on the notebook -
Storage Legacy Object Reader (
roles/storage.legacyObjectReader
) on the output bucket
For more information about granting roles, see Manage access to projects, folders, and organizations.
These predefined roles contain the permissions required to run a notebook in Colab Enterprise. To see the exact permissions that are required, expand the Required permissions section:
Required permissions
The following permissions are required to run a notebook in Colab Enterprise:
-
dataform.locations.list
on the notebook -
dataform.repositories.computeAccessTokenStatus
on the notebook -
dataform.repositories.fetchHistory
on the notebook -
dataform.repositories.fetchRemoteBranches
on the notebook -
dataform.repositories.get
on the notebook -
dataform.repositories.getIamPolicy
on the notebook -
dataform.repositories.list
on the notebook -
dataform.repositories.queryDirectoryContents
on the notebook -
dataform.repositories.readFile
on the notebook -
logging.logEntries.create
on the project -
logging.logEntries.route
on the project -
monitoring.metricDescriptors.create
on the project -
monitoring.metricDescriptors.get
on the project -
monitoring.metricDescriptors.list
on the project -
monitoring.monitoredResourceDescriptors.get
on the project -
monitoring.monitoredResourceDescriptors.list
on the project -
monitoring.timeSeries.create
on the project -
resourcemanager.projects.get
on the project -
resourcemanager.projects.list
on the project -
storage.buckets.get
on the notebook -
storage.managedFolders.create
on the notebook -
storage.managedFolders.delete
on the notebook -
storage.managedFolders.get
on the notebook -
storage.managedFolders.list
on the notebook -
storage.multipartUploads.abort
on the notebook -
storage.multipartUploads.create
on the notebook -
storage.multipartUploads.list
on the notebook -
storage.multipartUploads.listParts
on the notebook -
storage.objects.create
on the notebook -
storage.objects.delete
on the notebook -
storage.objects.get
on the notebook -
storage.objects.list
on the notebook -
storage.objects.restore
on the notebook -
storage.objects.setRetention
on the notebook
Your administrator might also be able to give the principal these permissions with custom roles or other predefined roles.
Run a notebook once
To run a notebook one time, you can use the Google Cloud console, the Google Cloud CLI, or the Vertex AI Python client library.
Console
-
In the Google Cloud console, go to the Colab Enterprise Notebooks page.
-
In the Region menu, select the region that contains your notebook.
-
Next to a notebook, click the
Notebook actions menu and select Schedule. -
In the Schedule name field, enter a name for your schedule.
-
Click the Runtime template list, and select a runtime template. The runtime template determines the specifications of the runtime that runs your notebook.
-
Under Run schedule, select One-off to run your notebook as soon as you submit the notebook run.
-
Next to the Cloud Storage output location field, click Browse to open the Select folder dialog.
-
Select a Cloud Storage bucket. Or, to create a bucket, click Create new bucket and complete the dialog.
-
If you selected a runtime template without end-user credentials enabled, the dialog includes a Service account field. In the Service account field, enter a service account's email address.
-
Click Submit.
The notebook run starts immediately.
gcloud
Before using any of the command data below, make the following replacements:
DISPLAY_NAME
: the display name for your notebook run.NOTEBOOK_RUNTIME_TEMPLATE
: the notebook runtime template that specifies your runtime's compute configuration.NOTEBOOK_URI
: the Cloud Storage URI of the notebook to run.OUTPUT_URI
: the Cloud Storage location where you want to store results.USER_EMAIL
: the user account email address that specifies the notebook run's access to Google Cloud resources.PROJECT_ID
: your project ID.REGION
: the region where your notebook will run.
Execute the following command:
Linux, macOS, or Cloud Shell
gcloud colab executions create --display-name="DISPLAY_NAME" \ --notebook-runtime-template=NOTEBOOK_RUNTIME_TEMPLATE \ --gcs-notebook-uri=NOTEBOOK_URI \ --gcs-output-uri=OUTPUT_URI \ --user-email=USER_EMAIL \ --project=PROJECT_ID \ --region=REGION
Windows (PowerShell)
gcloud colab executions create --display-name="DISPLAY_NAME" ` --notebook-runtime-template=NOTEBOOK_RUNTIME_TEMPLATE ` --gcs-notebook-uri=NOTEBOOK_URI ` --gcs-output-uri=OUTPUT_URI ` --user-email=USER_EMAIL ` --project=PROJECT_ID ` --region=REGION
Windows (cmd.exe)
gcloud colab executions create --display-name="DISPLAY_NAME" ^ --notebook-runtime-template=NOTEBOOK_RUNTIME_TEMPLATE ^ --gcs-notebook-uri=NOTEBOOK_URI ^ --gcs-output-uri=OUTPUT_URI ^ --user-email=USER_EMAIL ^ --project=PROJECT_ID ^ --region=REGION
For more information about managing Colab Enterprise notebook runs from the command line, see the gcloud CLI documentation.
Python
Before trying this sample, install the Vertex AI SDK for Python. The Vertex AI Python client library is installed when you install the Vertex AI SDK for Python. For more information, see the Vertex AI SDK for Python API reference documentation.
from google.cloud import aiplatform_v1beta1 PROJECT_ID = "my-project" LOCATION = "us-central1" API_ENDPOINT = f"{LOCATION}-aiplatform.googleapis.com" PARENT = f"projects/{PROJECT_ID}/locations/{LOCATION}" notebook_service_client = aiplatform_v1beta1.NotebookServiceClient(client_options = { "api_endpoint": API_ENDPOINT, }) operation = notebook_service_client.create_notebook_execution_job(parent=PARENT, notebook_execution_job={ "display_name": "my-execution-job", # Specify a NotebookRuntimeTemplate to source compute configuration from "notebook_runtime_template_resource_name": f"projects/{PROJECT_ID}/locations/{LOCATION}/notebookRuntimeTemplates/{template_id}", # Specify a Colab Enterprise notebook to run "dataformRepositorySource": { "dataformRepositoryResourceName": f"projects/{PROJECT_ID}/locations/{LOCATION}/repositories/{repository_id}", }, # Specify a Cloud Storage bucket to store output artifacts "gcs_output_uri": "gs://my-bucket/, # Specify the identity that runs the notebook "execution_user": {EMAIL}, # Run as the service account instead # "service_account": "my-service-account", }) print("Waiting for operation to complete...") result = operation.result()
You can view results from completed notebook runs on the Execution jobs tab.
Schedule a notebook run
To schedule a notebook run, you can use the Google Cloud console, the gcloud CLI, or the Vertex AI Python client library.
Console
-
In the Google Cloud console, go to the Colab Enterprise Notebooks page.
-
In the Region menu, select the region that contains your notebook.
-
Next to a notebook, click the
Notebook actions menu and select Schedule. -
In the Schedule name field, enter a name for your schedule.
-
Click the Runtime template list, and select a runtime template. The runtime template determines the specifications of the runtime that runs your notebook.
-
Under Run schedule, select Recurring to schedule the notebook run for a specific interval of time.
-
Complete the scheduling dialog.
-
Next to the Cloud Storage output location field, click Browse to open the Select folder dialog.
-
Select a Cloud Storage bucket. Or, to create a bucket, click Create new bucket and complete the dialog.
-
If you selected a runtime template without end-user credentials enabled, the dialog includes a Service account field. In the Service account field, enter a service account's email address.
-
Click Submit.
Scheduled notebook runs start automatically on the schedule that you set.
gcloud
Before using any of the command data below, make the following replacements:
DISPLAY_NAME
: the display name of your schedule.CRON_SCHEDULE
: the schedule that you set, in unix-cron format. For example,00 19 * * MON
means weekly on Monday, at 1900 hours Greenwich Mean Time (GMT).NOTEBOOK_RUN_NAME
: the display name for notebook runs generated by this schedule.NOTEBOOK_RUNTIME_TEMPLATE
: the notebook runtime template that specifies your runtime's compute configuration.NOTEBOOK_URI
: the Cloud Storage URI of the notebook to run.OUTPUT_URI
: the Cloud Storage location where you want to store results.USER_EMAIL
: the user account email address that specifies the notebook run's access to Google Cloud resources.PROJECT_ID
: your project ID.REGION
: the region where your schedule will run.
Execute the following command:
Linux, macOS, or Cloud Shell
gcloud colab schedules create --display-name="DISPLAY_NAME" \ --cron-schedule=CRON_SCHEDULE \ --execution-display-name=NOTEBOOK_RUN_NAME \ --notebook-runtime-template=NOTEBOOK_RUNTIME_TEMPLATE \ --gcs-notebook-uri=NOTEBOOK_URI \ --gcs-output-uri=OUTPUT_URI \ --user-email=USER_EMAIL \ --project=PROJECT_ID \ --region=REGION
Windows (PowerShell)
gcloud colab schedules create --display-name="DISPLAY_NAME" ` --cron-schedule=CRON_SCHEDULE ` --execution-display-name=NOTEBOOK_RUN_NAME ` --notebook-runtime-template=NOTEBOOK_RUNTIME_TEMPLATE ` --gcs-notebook-uri=NOTEBOOK_URI ` --gcs-output-uri=OUTPUT_URI ` --user-email=USER_EMAIL ` --project=PROJECT_ID ` --region=REGION
Windows (cmd.exe)
gcloud colab schedules create --display-name="DISPLAY_NAME" ^ --cron-schedule=CRON_SCHEDULE ^ --execution-display-name=NOTEBOOK_RUN_NAME ^ --notebook-runtime-template=NOTEBOOK_RUNTIME_TEMPLATE ^ --gcs-notebook-uri=NOTEBOOK_URI ^ --gcs-output-uri=OUTPUT_URI ^ --user-email=USER_EMAIL ^ --project=PROJECT_ID ^ --region=REGION
For more information about creating Colab Enterprise notebook schedules from the command line, see the gcloud CLI documentation.
Python
Before trying this sample, install the Vertex AI SDK for Python. The Vertex AI Python client library is installed when you install the Vertex AI SDK for Python. For more information, see the Vertex AI SDK for Python API reference documentation.
from google.cloud import aiplatform_v1beta1 PROJECT_ID = "my-project" LOCATION = "us-central1" API_ENDPOINT = f"{LOCATION}-aiplatform.googleapis.com" PARENT = f"projects/{PROJECT_ID}/locations/{LOCATION}" schedules_service_client = aiplatform_v1beta1.ScheduleServiceClient(client_options = { "api_endpoint": API_ENDPOINT, }) schedule = schedules_service_client.create_schedule(parent=PARENT, schedule={ "display_name": "my-notebook-schedule", # Time specification. TZ is optional. # cron = "* * * * *" to run it in the next minute. "cron": "TZ=America/Los_Angeles * * * * *", # How many runs the schedule will trigger before it becomes COMPLETED. # A Schedule in COMPLETED state will not trigger any more runs. "max_run_count": 1, "max_concurrent_run_count": 1, "create_notebook_execution_job_request": { "parent": PARENT, "notebook_execution_job": { "display_name": "my-execution-job", # Specify a NotebookRuntimeTemplate to source compute configuration from "notebook_runtime_template_resource_name": f"projects/{PROJECT_ID}/locations/{LOCATION}/notebookRuntimeTemplates/{template_id}", # Specify a Colab Enterprise notebook to run "dataformRepositorySource": { "dataformRepositoryResourceName": f"projects/{PROJECT_ID}/locations/{LOCATION}/repositories/{repository_id}", }, # Specify a Cloud Storage bucket to store output artifacts "gcs_output_uri": "gs://my-bucket/, # Specify the identity that runs the notebook "execution_user": {EMAIL}, # Run as the service account instead # "service_account": "my-service-account", } } })
In the Google Cloud console, you can view your schedules on the Schedules tab. You can view results from the completed notebook runs on the Execution jobs tab.
View results
To view notebook run results, you can use the Google Cloud console, the gcloud CLI, or the Vertex AI Python client library.
Console
-
In the Google Cloud console, go to the Colab Enterprise Execution jobs page.
-
Next to the notebook run that you want to view results for, click View result.
A read-only panel opens that shows the results of the notebook run.
-
To close the panel, click Close.
gcloud
Before using any of the command data below, make the following replacements:
PROJECT_ID
: your project ID.REGION
: the region where your notebook run results are located.SCHEDULE_NAME
: the name of the schedule to view results for. To see results from all schedules, omit the--filter
flag.
Execute the following command:
Linux, macOS, or Cloud Shell
gcloud colab executions list --project=PROJECT_ID \ --region=REGION \ --filter="scheduleResourceName:SCHEDULE_NAME"
Windows (PowerShell)
gcloud colab executions list --project=PROJECT_ID ` --region=REGION ` --filter="scheduleResourceName:SCHEDULE_NAME"
Windows (cmd.exe)
gcloud colab executions list --project=PROJECT_ID ^ --region=REGION ^ --filter="scheduleResourceName:SCHEDULE_NAME"
For more information about listing Colab Enterprise notebook runs from the command line, see the gcloud CLI documentation.
Python
Before trying this sample, install the Vertex AI SDK for Python. The Vertex AI Python client library is installed when you install the Vertex AI SDK for Python. For more information, see the Vertex AI SDK for Python API reference documentation.
from google.cloud import aiplatform_v1beta1 PROJECT_ID = "my-project" LOCATION = "us-central1" API_ENDPOINT = f"{LOCATION}-aiplatform.googleapis.com" PARENT = f"projects/{PROJECT_ID}/locations/{LOCATION}" notebook_service_client = aiplatform_v1beta1.NotebookServiceClient(client_options = { "api_endpoint": API_ENDPOINT, }) notebook_execution_jobs = notebook_service_client.list_notebook_execution_jobs(parent=PARENT) notebook_execution_jobs
Delete results
To delete a result from one of your notebook runs, you can use the Google Cloud console or the gcloud CLI.
Console
-
In the Google Cloud console, go to the Colab Enterprise Execution jobs page.
-
Select the notebook run that you want to delete the result for.
-
Click
Delete. -
To confirm the deletion, click Confirm.
gcloud
Before using any of the command data below, make the following replacements:
NOTEBOOK_RUN_ID
: the ID of the notebook run that you want to delete.PROJECT_ID
: your project ID.REGION
: the region where your notebook run is located.
Execute the following command:
Linux, macOS, or Cloud Shell
gcloud colab executions delete NOTEBOOK_RUN_ID \ --project=PROJECT_ID \ --region=REGION
Windows (PowerShell)
gcloud colab executions delete NOTEBOOK_RUN_ID ` --project=PROJECT_ID ` --region=REGION
Windows (cmd.exe)
gcloud colab executions delete NOTEBOOK_RUN_ID ^ --project=PROJECT_ID ^ --region=REGION
For more information about deleting Colab Enterprise notebook runs from the command line, see the gcloud CLI documentation.
Share a notebook run's results
You can share notebook run results by providing access to the Cloud Storage bucket that contains your notebook run. Providing this access also grants users access to any other resources in the same Cloud Storage bucket (see Security considerations).
For more information, see the Cloud Storage Sharing and collaboration page.
Security considerations
Your notebook run results are stored as notebook (IPYNB) files in a Cloud Storage bucket. Consider the following when you grant access to this bucket:
Anyone with access to the bucket can see the notebook file's code and the results of the notebook run.
Anyone with the ability to change the contents of the bucket can change the contents of the notebook file.
When your schedule is configured to use personal credentials, only the specified user is able to modify the schedule or trigger the schedule.
When your schedule is configured to use a service account, only users with the
iam.serviceAccounts.actAs
permission on the service account is able to modify
the schedule or trigger the schedule.
View schedule details
You can view information about a schedule, including:
- The Cloud Storage bucket that the schedule stores results in.
- The start and end time.
- The frequency.
To view schedule details, you can use the Google Cloud console or the gcloud CLI.
Console
-
In the Google Cloud console, go to the Colab Enterprise Schedules page.
-
Click the name of a schedule.
The Schedule details page opens.
-
To go back to the Schedules page, click
Back to previous page.
gcloud
Before using any of the command data below, make the following replacements:
SCHEDULE
: your schedule ID.PROJECT_ID
: your project ID.REGION
: the region where your schedule is located.
Execute the following command:
Linux, macOS, or Cloud Shell
gcloud colab schedules describe SCHEDULE \ --project=PROJECT_ID \ --region=REGION
Windows (PowerShell)
gcloud colab schedules describe SCHEDULE ` --project=PROJECT_ID ` --region=REGION
Windows (cmd.exe)
gcloud colab schedules describe SCHEDULE ^ --project=PROJECT_ID ^ --region=REGION
For more information about viewing Colab Enterprise schedules from the command line, see the gcloud CLI documentation.
Pause, resume, or delete a schedule
To pause, resume, or delete a schedule, you can use the Google Cloud console or the gcloud CLI.
Console
-
In the Google Cloud console, go to the Colab Enterprise Schedules page.
-
Select a schedule.
-
Click
Pause, Resume, or Delete.
gcloud
Before using any of the command data below, make the following replacements:
ACTION
: one ofpause
,resume
, ordelete
.SCHEDULE_ID
: your schedule ID.PROJECT_ID
: your project ID.REGION
: the region where your schedule is located.
Execute the following command:
Linux, macOS, or Cloud Shell
gcloud colab schedules ACTION SCHEDULE_ID \ --project=PROJECT_ID \ --region=REGION
Windows (PowerShell)
gcloud colab schedules ACTION SCHEDULE_ID ` --project=PROJECT_ID ` --region=REGION
Windows (cmd.exe)
gcloud colab schedules ACTION SCHEDULE_ID ^ --project=PROJECT_ID ^ --region=REGION
For more information about managing Colab Enterprise schedules from the command line, see the gcloud CLI documentation.
What's next
Learn more about runtimes and runtime templates.
Learn how to create a runtime template.
Learn more about accessing Google Cloud services and APIs in your notebook.