Skip to content
This repository has been archived by the owner on Aug 10, 2023. It is now read-only.

Latest commit

 

History

History
183 lines (120 loc) · 9.15 KB

File metadata and controls

183 lines (120 loc) · 9.15 KB
title description author tags date_published
Delete idle Compute Engine instances automatically
Use a simple and scalable serverless mechanism to automatically delete Compute Engine instances that are not in active use.
jpatokal
garbage collection, Cloud Scheduler, Cloud Functions
2021-03-19

Jani Patokallio | Solutions Architect | Google

Contributed by Google employees.

This tutorial offers a simple and scalable serverless mechanism to automatically delete (garbage-collect) Compute Engine virtual machine (VM) instances that are marked as idle.

Some cases in which this may be useful:

  • Developers or testers create one-off VM instances for testing a feature, but they might not always remember to manually delete the instances.
  • Workflows can require dynamically starting a large number of Compute Engine worker instances to perform a certain task. A best practice is to have instances delete themselves after the task is complete, but ensuring that this always happens can be difficult if the task is distributed or some workers stop because of errors.

Idle VM recommendations are generated automatically based on system utilization over the past 14 days. They are not available for some types of instances, including VMs belonging to managed instance groups, managed services like Dataflow or Google Kubernetes Engine, or instances with local resources like local SSDs, GPUs, or TPUS. For the full list of limitations, see Viewing and applying idle VM recommendations.

How it works

The following diagram shows a high-level overview of the solution:

High-level overview of the solution

The overall flow is the following:

  1. A Cloud Scheduler cron job is triggered regularly (for example, once a day). The Cloud Scheduler configuration specifies the label of the pool of VM instances to target and whether or not they should be deleted, using the following format:

    '{"label":"env=test,action=mark"}'
    

    If the configuration is empty ({}), all VM instances are considered targets.

  2. When the cron job is triggered, Cloud Scheduler calls a Cloud Function with the payload.

  3. Each time the function is triggered, it does the following:

    1. Queries the Idle VM Recommender for a list of idle Compute Engine instances.
    2. Filters the list for instances that have the target label.
    3. Iterates through the instances and does the following:
      • If the Cloud Scheduler configuration includes the label action=stop, then the target is immediately stopped and the recommendation status is set to SUCCEEDED.
      • If the Cloud Scheduler configuration includes the label action=delete, then the target is immediately deleted and the recommendation status is set to SUCCEEDED.
      • Otherwise, the label delete=true is applied to the instance and the recommendation status is set to CLAIMED.

By default, potential idle instances are only labeled. You can obtain the list of labeled instances by checking recommendation status or searching for instances with the label.

Costs

This tutorial uses the following Google Cloud components:

  • Compute Engine
  • Cloud Scheduler
  • Cloud Functions
  • Recommender API

Use the pricing calculator to generate a cost estimate based on this projected usage.

Cloud Scheduler is free for up to 3 jobs per month.

New Google Cloud users may be eligible for a free trial.

Before you begin

The following steps create a new project and new VM instances, so idle VM instance recommendations may not be generated until 14 days of system metrics are available. For immediate results, you can deploy this in an existing project with idle instances instead.

  1. If you don’t already have one, create a Google Account.

  2. Create a Google Cloud project: In the Cloud Console, click Create project.

  3. Enable billing for the project.

  4. Open Cloud Shell.

  5. Create an App Engine app, which is required by Cloud Scheduler:

    gcloud app create --region=us-central
    
  6. Enable the APIs used by this tutorial:

    gcloud services enable appengine.googleapis.com cloudbuild.googleapis.com \
      cloudfunctions.googleapis.com cloudscheduler.googleapis.com compute.googleapis.com \
      recommender.googleapis.com
    

Set up the automated cleanup code

Run the commands in this section in Cloud Shell.

  1. Clone the GitHub repository:

    git clone https://github.com/GoogleCloudPlatform/community
    
  2. Go to the delete-idle-instances directory:

    cd community/tutorials/delete-idle-instances
    
  3. Create a service account for Cloud Scheduler to use:

    gcloud iam service-accounts create scheduler-sa --display-name "Cloud Scheduler service account"
    export SCHEDULER_SA=scheduler-sa@${GOOGLE_CLOUD_PROJECT}.iam.gserviceaccount.com
    
  4. Assign the IAM roles needed to allow this service account to invoke Cloud Functions and administer Compute Engine recommendations and instances:

    gcloud projects add-iam-policy-binding ${GOOGLE_CLOUD_PROJECT} \
      --member serviceAccount:${SCHEDULER_SA} \
      --role roles/cloudfunctions.invoker
    gcloud projects add-iam-policy-binding ${GOOGLE_CLOUD_PROJECT} \
      --member serviceAccount:${SCHEDULER_SA} \
      --role roles/compute.instanceAdmin
    gcloud projects add-iam-policy-binding ${GOOGLE_CLOUD_PROJECT} \
      --member serviceAccount:${SCHEDULER_SA} \
      --role roles/recommender.computeAdmin
    
  5. Deploy the Cloud Function that marks idle instances:

    gcloud functions deploy mark_idle_instances --trigger-http --region us-central1 --runtime=nodejs12 \
      --service-account ${SCHEDULER_SA} --entry-point=deleteIdleInstances --no-allow-unauthenticated
    

    Note: The Cloud Function runs with the default App Engine service account, which has project editor rights.

  6. Configure Cloud Scheduler to invoke the Cloud Function once per day:

    gcloud scheduler jobs create http mark-idle-instances-job --schedule="0 0 * * *" \
      --uri "https://us-central1-${GOOGLE_CLOUD_PROJECT}.cloudfunctions.net/mark_idle_instances" \
      --http-method POST --oidc-service-account-email ${SCHEDULER_SA} \
      --message-body='{"label":"env=test"}'
    

    The schedule is specified in unix-cron format. 0 0 * * * means that the jobs runs at 0:00 (midnight) UTC every day of the month, every month, and every day of the week. More simply put, the job runs once per day.

  7. Verify that the job has been created:

    gcloud scheduler jobs list
    

    Jobs also appear on the Cloud Scheduler page in the Cloud Console. On that page, you can view execution logs for each job by clicking View in the Logs column.

Test the automated tagging

  1. Create a test instance labeled env=test:

    gcloud compute instances create idle-test --zone=us-central1-a \
      --machine-type=n1-standard-1 --labels=env=test
    
  2. Check that the new instance has started successfully:

    gcloud compute instances list --format='table(name,status,labels.list())'
    
  3. Wait 14 days and run the same command again:

    gcloud compute instances list --format='table(name,status,labels.list())'
    

    The idle-test instance should be shown as RUNNING with the label delete=true now applied.

  4. List and review instances marked for deletion:

    gcloud compute instances list --filter='labels.delete=true' --format='value(name)'
    
  5. (Optional) Delete the instances that are marked for deletion:

    gcloud compute instances delete |\
    $(gcloud compute instances list --filter='labels.delete=true' --format='value(name)')
    

You can also see the Cloud Function execution results, including the names of the marked instances, by viewing the Cloud Function logs from the Cloud Functions page in the Cloud Console.

Shut down resources used in the tutorial

After you have tested the automated cleanup of VM instances, you can either delete the entire project or delete the individual resources that you created to prevent further billing for them on your account.