기본적으로 Cloud Data Fusion은 프로젝트의 Dataproc 클러스터를 사용하여 파이프라인을 실행합니다. 프로젝트가 Dataproc 네트워킹 요구사항을 충족하는지 확인합니다.
새 프로젝트는 기본 네트워크로 시작합니다.
기본 네트워크는 default-allow-ssh 방화벽 규칙으로 자동 입력되어 TCP 포트 22를 통해 모든 소스에서 네트워크의 모든 인스턴스로 들어오는 연결을 허용합니다. 6.2.0 이하 버전에서 TCP 포트 22의 인그레스를 허용하는 이 규칙이 Cloud Data Fusion 인스턴스에서 사용하는 네트워크에 존재하지 않는 경우 이 규칙을 만들어야 합니다.
인스턴스 만들기
콘솔
API가 사용 설정된 경우 Google Cloud 콘솔의 Cloud Data Fusion 섹션에 Cloud Data Fusion 인스턴스를 관리할 수 있는 인스턴스 페이지가 표시됩니다.
Dataproc에서 Cloud Data Fusion 파이프라인을 실행하는 데 사용할 Dataproc 서비스 계정을 지정합니다. 기본값은 Compute Engine 계정입니다. 버전에 관계없이 서비스 계정에 필요에 맞는 적절한 ID 및 액세스 관리 역할이 있는지 확인합니다. 자세한 내용은 서비스 계정 사용자 권한 부여를 참조하세요.
[[["이해하기 쉬움","easyToUnderstand","thumb-up"],["문제가 해결됨","solvedMyProblem","thumb-up"],["기타","otherUp","thumb-up"]],[["이해하기 어려움","hardToUnderstand","thumb-down"],["잘못된 정보 또는 샘플 코드","incorrectInformationOrSampleCode","thumb-down"],["필요한 정보/샘플이 없음","missingTheInformationSamplesINeed","thumb-down"],["번역 문제","translationIssue","thumb-down"],["기타","otherDown","thumb-down"]],["최종 업데이트: 2025-09-04(UTC)"],[[["\u003cp\u003eThis page details the process of creating a Cloud Data Fusion instance, covering prerequisites, console setup, and API methods.\u003c/p\u003e\n"],["\u003cp\u003eBefore creating an instance, users must enable the Cloud Data Fusion API and ensure they have the necessary \u003ccode\u003einstances.create\u003c/code\u003e permission.\u003c/p\u003e\n"],["\u003cp\u003eWhen creating an instance, users can specify various configurations, such as the instance name, region, version, edition, and advanced options like private IP and logging.\u003c/p\u003e\n"],["\u003cp\u003eInstance creation can be done via the Google Cloud console, navigating to the Cloud Data Fusion Instances page and selecting "Create an instance," or via a REST API request.\u003c/p\u003e\n"],["\u003cp\u003eAfter an instance is created, certain advanced options like Cloud Logging and Private IP cannot be added or removed and must be handled through an API \u003ccode\u003ePATCH\u003c/code\u003e method.\u003c/p\u003e\n"]]],[],null,["# Create a public instance\n\nThis page explains how to create a Cloud Data Fusion\n[instance](/data-fusion/docs/concepts/overview).\n\nBefore you begin\n----------------\n\n- [Enable the Cloud Data Fusion API](/data-fusion/docs/how-to/enable-service).\n- The following permission is required to create Cloud Data Fusion instances: `instances.create`. For more information, see [Access control](/data-fusion/docs/access-control).\n- Cloud Data Fusion instances run as the [Compute Engine\n default service account](/compute/docs/access/service-accounts#compute_engine_default_service_account). For information about the types and roles available, see [Service accounts](/iam/docs/service-account-types#google-managed_service_accounts).\n- By default, Cloud Data Fusion executes pipelines using a Dataproc cluster in your project. Ensure that your project meets the [Dataproc networking requirements](/dataproc/docs/concepts/configuring-clusters/network).\n- New projects start with a [default network](/vpc/docs/vpc#default-network). The default network is pre-populated with a firewall rule [default-allow-ssh](/vpc/docs/firewalls#more_rules_default_vpc), that allows incoming connections on TCP port 22 from any source to any instance in the network. In versions earlier than 6.2.0, if this rule to allow ingress on TCP port 22 doesn't exist in the network used by your Cloud Data Fusion instance, you must [create this rule](/vpc/docs/using-firewalls#creating_firewall_rules).\n\nCreate an instance\n------------------\n\n**Caution:** After you create a Cloud Data Fusion instance, you cannot change its edition. \n\n### Console\n\nIf the [API is enabled](/data-fusion/docs/how-to/enable-service), the\nCloud Data Fusion section in the Google Cloud console shows an\n**Instances** page, where you can create and manage your\nCloud Data Fusion instances.\n\n1. In the console, go to the Cloud Data Fusion page.\n2. Click **Instances** , and then click **Create an instance** .\n\n [Create an instance](https://console.cloud.google.com/data-fusion/instance-create)\n3. Enter an **Instance name**.\n4. Enter a **Description** for your instance.\n5. Specify the **Region** in which to create the instance.\n6. Specify the Cloud Data Fusion [**Version**](/data-fusion/docs/release-notes).\n7. Select an [**Edition**](/data-fusion/pricing). The default is **Basic**.\n8. Optional: Click **Add accelerators** to select accelerators to use in your instance.\n9. Specify the [**Dataproc service account**](/dataproc/docs/concepts/configuring-clusters/service-accounts) to use for running your Cloud Data Fusion pipeline in Dataproc. The default is the Compute Engine account. Regardless of version, make sure that a service account has appropriate Identity and Access Management roles for your needs. For more information, see [Granting service account user permission](/data-fusion/docs/how-to/granting-service-account-permission).\n10. Optional: Specify any advanced options. If you don't specify anything,\n the following defaults are used:\n\n11. Click **Create**. It takes up to 30 minutes for the instance\n creation process to complete.\n\nWhile Cloud Data Fusion creates your instance, a progress\nwheel displays next to the instance name on the **Instances** page. After\ncompletion, it turns into a green check mark and indicates that you can\nstart using the instance.\n| After you create an instance, you cannot add or remove the options for [Cloud Logging and Cloud Monitoring](/data-fusion/docs/how-to/viewing-stackdriver-logs#enabling-stackdriver) or [Private IP](/data-fusion/docs/how-to/create-private-ip). To update these options with the Cloud Data Fusion API, call the [`PATCH`](/data-fusion/docs/reference/rest/v1/projects.locations.instances/patch) method.\n\n### API\n\nCreate an instance:\n\nTo create a Cloud Data Fusion instance with the\n[Cloud Data Fusion REST API](/data-fusion/docs/reference/rest),\nconstruct an\n[`instances.create`](/data-fusion/docs/reference/rest/v1/projects.locations.instances/create)\nAPI request, filling in the\n[`Instance`](/data-fusion/docs/reference/rest/v1/projects.locations.instances#Instance)\nresource with your configuration information.\n\nWhat's next\n-----------\n\n- Learn more about [Cloud Data Fusion](/data-fusion/docs/concepts/overview).\n- Work through a [tutorial](/data-fusion/docs/tutorials/targeting-campaign-pipeline)."]]