Sauf indication contraire, le contenu de cette page est régi par une licence Creative Commons Attribution 4.0, et les échantillons de code sont régis par une licence Apache 2.0. Pour en savoir plus, consultez les Règles du site Google Developers. Java est une marque déposée d'Oracle et/ou de ses sociétés affiliées.
Dernière mise à jour le 2025/09/04 (UTC).
[[["Facile à comprendre","easyToUnderstand","thumb-up"],["J'ai pu résoudre mon problème","solvedMyProblem","thumb-up"],["Autre","otherUp","thumb-up"]],[["Difficile à comprendre","hardToUnderstand","thumb-down"],["Informations ou exemple de code incorrects","incorrectInformationOrSampleCode","thumb-down"],["Il n'y a pas l'information/les exemples dont j'ai besoin","missingTheInformationSamplesINeed","thumb-down"],["Problème de traduction","translationIssue","thumb-down"],["Autre","otherDown","thumb-down"]],["Dernière mise à jour le 2025/09/04 (UTC)."],[[["\u003cp\u003eThis guide explains how to reuse Dataproc clusters for pipeline runs within Cloud Data Fusion, which is possible with instances version 6.5.0 or later.\u003c/p\u003e\n"],["\u003cp\u003eCluster reuse can be enabled in a new compute profile by setting the "Skip cluster delete" field to \u003ccode\u003eTrue\u003c/code\u003e when creating the profile.\u003c/p\u003e\n"],["\u003cp\u003eTo reuse clusters in a deployed pipeline, you can set "Skip cluster delete" to \u003ccode\u003eTrue\u003c/code\u003e within the chosen profile's customization settings.\u003c/p\u003e\n"],["\u003cp\u003eWhen setting up cluster reuse, you will also need to set the "Max idle time" field to optimize cluster management.\u003c/p\u003e\n"]]],[],null,["# Reuse clusters\n\nThis page describes how to reuse Dataproc clusters for your\npipeline runs in Cloud Data Fusion. For more information, see [When to\nreuse clusters](/data-fusion/docs/concepts/configure-clusters#reuse-clusters)\nand [Run a pipeline against an existing Dataproc\ncluster](/data-fusion/docs/how-to/running-against-existing-dataproc).\n\nBefore you begin\n----------------\n\n- You must have a Cloud Data Fusion instance in version 6.5.0 or later.\n\nEnable cluster reuse\n--------------------\n\nYou can reuse clusters in a new compute profile, or in one that's been used in a\ndeployed pipeline.\n\n### Enable cluster reuse in a new profile\n\n1. Go to your instance:\n\n\n 1. In the Google Cloud console, go to the Cloud Data Fusion page.\n\n 2. To open the instance in the Cloud Data Fusion Studio,\n click **Instances** , and then click **View instance**.\n\n [Go to Instances](https://console.cloud.google.com/data-fusion/locations/-/instances)\n\n \u003cbr /\u003e\n\n2. Click **System admin \\\u003e Configuration \\\u003e System compute\n profiles**.\n\n3. Click **Create new profile**.\n\n4. Choose the **Dataproc** provisioner.\n\n5. In the **Create a profile for Dataproc** window, enter the\n details about your cluster:\n\n 1. In the **Profile label** and **Profile name** fields, enter a name to identify the profile---for example, `execution_compute-profile`.\n 2. In the **Description** field, describe the purpose of the profile---for example, `Profile used for pipeline execution`.\n 3. In the **Max idle time** field, enter a value. For more information, see [Set max idle time](/data-fusion/docs/troubleshoot-deleting-clusters#set_max_idle_time).\n 4. Set the **Skip cluster delete** field to `True`. For more information, see [When to reuse clusters](/data-fusion/docs/concepts/configure-clusters#reuse-clusters).\n 5. Optional: configure other optional fields.\n 6. Click **Create**.\n\n### Enable cluster reuse in a deployed pipeline\n\n1. Go to your instance:\n\n\n 1. In the Google Cloud console, go to the Cloud Data Fusion page.\n\n 2. To open the instance in the Cloud Data Fusion Studio,\n click **Instances** , and then click **View instance**.\n\n [Go to Instances](https://console.cloud.google.com/data-fusion/locations/-/instances)\n2. Click **List**.\n\n3. Click the **Deployed** tab and click a pipeline name. The deployed pipeline\n opens on the **Studio** page in the Cloud Data Fusion web interface.\n\n4. Click **Configure**.\n\n5. In the **Compute config** window, go to the chosen profile and click\n **Customize**.\n\n6. In the window that opens, enter the following values:\n\n 1. In the **Max Idle Time** field, enter a value. For more information, see [Set max idle time](/data-fusion/docs/troubleshoot-deleting-clusters#set_max_idle_time).\n 2. Set **Skip cluster delete** to `True`. For more information, see [When to reuse clusters](/data-fusion/docs/concepts/configure-clusters#reuse-clusters).\n7. Click **Done**.\n\nWhat's next\n-----------\n\n- Learn more about [configuring clusters](/data-fusion/docs/concepts/configure-clusters).\n- Troubleshoot [deleting clusters](/data-fusion/docs/troubleshoot-deleting-clusters)."]]