Secara default, Cloud Data Fusion mengeksekusi pipeline menggunakan
cluster Dataproc dalam project Anda. Pastikan project Anda memenuhi
persyaratan jaringan Dataproc.
Project baru dimulai dengan jaringan default.
Jaringan default diisi otomatis dengan aturan firewall default-allow-ssh,
yang mengizinkan koneksi masuk di TCP port 22 dari sumber mana pun ke instance mana pun
dalam jaringan. Pada versi sebelum 6.2.0, jika aturan ini untuk mengizinkan traffic masuk di TCP
port 22 tidak ada di jaringan yang digunakan oleh instance Cloud Data Fusion, Anda harus membuat aturan ini.
Membuat instance
Konsol
Jika API diaktifkan, bagian Cloud Data Fusion di Google Cloud console akan menampilkan halaman Instance, tempat Anda dapat membuat dan mengelola instance Cloud Data Fusion.
Opsional: Klik Tambahkan akselerator untuk memilih akselerator yang akan digunakan di instance Anda.
Tentukan akun layanan Dataproc
yang akan digunakan untuk menjalankan pipeline Cloud Data Fusion di
Dataproc. Default-nya adalah akun Compute Engine. Apa pun versinya, pastikan akun layanan memiliki peran Identity and Access Management yang sesuai untuk kebutuhan Anda. Untuk
mengetahui informasi selengkapnya, lihat
Memberikan izin pengguna akun layanan.
Opsional: Tentukan opsi lanjutan. Jika Anda tidak menentukan apa pun, setelan default berikut akan digunakan:
Label resource untuk instance yang akan digunakan untuk menganotasi resource dasar terkait, seperti VM Compute Engine. Kunci label dan nilai label hanya boleh berisi huruf,
angka, tanda hubung, dan garis bawah. Kunci label harus diawali dengan huruf atau angka.
Tidak ada
Pemeliharaan
Mengaktifkan masa pemeliharaan
Tetapkan periode waktu saat Cloud Data Fusion dapat melakukan
operasi pemeliharaan pada instance. Untuk informasi selengkapnya, lihat
Mengonfigurasi periode pemeliharaan.
Nonaktif
Klik Buat. Diperlukan waktu hingga 30 menit untuk menyelesaikan proses pembuatan instance.
Saat Cloud Data Fusion membuat instance, roda
progres akan ditampilkan di samping nama instance di halaman Instance. Setelah selesai, status akan berubah menjadi tanda centang hijau dan menunjukkan bahwa Anda dapat mulai menggunakan instance.
[[["Mudah dipahami","easyToUnderstand","thumb-up"],["Memecahkan masalah saya","solvedMyProblem","thumb-up"],["Lainnya","otherUp","thumb-up"]],[["Sulit dipahami","hardToUnderstand","thumb-down"],["Informasi atau kode contoh salah","incorrectInformationOrSampleCode","thumb-down"],["Informasi/contoh yang saya butuhkan tidak ada","missingTheInformationSamplesINeed","thumb-down"],["Masalah terjemahan","translationIssue","thumb-down"],["Lainnya","otherDown","thumb-down"]],["Terakhir diperbarui pada 2025-09-04 UTC."],[[["\u003cp\u003eThis page details the process of creating a Cloud Data Fusion instance, covering prerequisites, console setup, and API methods.\u003c/p\u003e\n"],["\u003cp\u003eBefore creating an instance, users must enable the Cloud Data Fusion API and ensure they have the necessary \u003ccode\u003einstances.create\u003c/code\u003e permission.\u003c/p\u003e\n"],["\u003cp\u003eWhen creating an instance, users can specify various configurations, such as the instance name, region, version, edition, and advanced options like private IP and logging.\u003c/p\u003e\n"],["\u003cp\u003eInstance creation can be done via the Google Cloud console, navigating to the Cloud Data Fusion Instances page and selecting "Create an instance," or via a REST API request.\u003c/p\u003e\n"],["\u003cp\u003eAfter an instance is created, certain advanced options like Cloud Logging and Private IP cannot be added or removed and must be handled through an API \u003ccode\u003ePATCH\u003c/code\u003e method.\u003c/p\u003e\n"]]],[],null,["# Create a public instance\n\nThis page explains how to create a Cloud Data Fusion\n[instance](/data-fusion/docs/concepts/overview).\n\nBefore you begin\n----------------\n\n- [Enable the Cloud Data Fusion API](/data-fusion/docs/how-to/enable-service).\n- The following permission is required to create Cloud Data Fusion instances: `instances.create`. For more information, see [Access control](/data-fusion/docs/access-control).\n- Cloud Data Fusion instances run as the [Compute Engine\n default service account](/compute/docs/access/service-accounts#compute_engine_default_service_account). For information about the types and roles available, see [Service accounts](/iam/docs/service-account-types#google-managed_service_accounts).\n- By default, Cloud Data Fusion executes pipelines using a Dataproc cluster in your project. Ensure that your project meets the [Dataproc networking requirements](/dataproc/docs/concepts/configuring-clusters/network).\n- New projects start with a [default network](/vpc/docs/vpc#default-network). The default network is pre-populated with a firewall rule [default-allow-ssh](/vpc/docs/firewalls#more_rules_default_vpc), that allows incoming connections on TCP port 22 from any source to any instance in the network. In versions earlier than 6.2.0, if this rule to allow ingress on TCP port 22 doesn't exist in the network used by your Cloud Data Fusion instance, you must [create this rule](/vpc/docs/using-firewalls#creating_firewall_rules).\n\nCreate an instance\n------------------\n\n**Caution:** After you create a Cloud Data Fusion instance, you cannot change its edition. \n\n### Console\n\nIf the [API is enabled](/data-fusion/docs/how-to/enable-service), the\nCloud Data Fusion section in the Google Cloud console shows an\n**Instances** page, where you can create and manage your\nCloud Data Fusion instances.\n\n1. In the console, go to the Cloud Data Fusion page.\n2. Click **Instances** , and then click **Create an instance** .\n\n [Create an instance](https://console.cloud.google.com/data-fusion/instance-create)\n3. Enter an **Instance name**.\n4. Enter a **Description** for your instance.\n5. Specify the **Region** in which to create the instance.\n6. Specify the Cloud Data Fusion [**Version**](/data-fusion/docs/release-notes).\n7. Select an [**Edition**](/data-fusion/pricing). The default is **Basic**.\n8. Optional: Click **Add accelerators** to select accelerators to use in your instance.\n9. Specify the [**Dataproc service account**](/dataproc/docs/concepts/configuring-clusters/service-accounts) to use for running your Cloud Data Fusion pipeline in Dataproc. The default is the Compute Engine account. Regardless of version, make sure that a service account has appropriate Identity and Access Management roles for your needs. For more information, see [Granting service account user permission](/data-fusion/docs/how-to/granting-service-account-permission).\n10. Optional: Specify any advanced options. If you don't specify anything,\n the following defaults are used:\n\n11. Click **Create**. It takes up to 30 minutes for the instance\n creation process to complete.\n\nWhile Cloud Data Fusion creates your instance, a progress\nwheel displays next to the instance name on the **Instances** page. After\ncompletion, it turns into a green check mark and indicates that you can\nstart using the instance.\n| After you create an instance, you cannot add or remove the options for [Cloud Logging and Cloud Monitoring](/data-fusion/docs/how-to/viewing-stackdriver-logs#enabling-stackdriver) or [Private IP](/data-fusion/docs/how-to/create-private-ip). To update these options with the Cloud Data Fusion API, call the [`PATCH`](/data-fusion/docs/reference/rest/v1/projects.locations.instances/patch) method.\n\n### API\n\nCreate an instance:\n\nTo create a Cloud Data Fusion instance with the\n[Cloud Data Fusion REST API](/data-fusion/docs/reference/rest),\nconstruct an\n[`instances.create`](/data-fusion/docs/reference/rest/v1/projects.locations.instances/create)\nAPI request, filling in the\n[`Instance`](/data-fusion/docs/reference/rest/v1/projects.locations.instances#Instance)\nresource with your configuration information.\n\nWhat's next\n-----------\n\n- Learn more about [Cloud Data Fusion](/data-fusion/docs/concepts/overview).\n- Work through a [tutorial](/data-fusion/docs/tutorials/targeting-campaign-pipeline)."]]