實際工作環境的營運資料庫,會複製到目標資料庫。這個資料庫可以位於內部部署或 Google Cloud上。Cloud Data Fusion 複製功能支援 MySQL、Microsoft SQL Server 和 Oracle 來源資料庫。
變更追蹤解決方案
Cloud Data Fusion 不會在來源資料庫上執行的代理程式上執行,而是會依賴變更追蹤解決方案來讀取來源資料庫中的變更。解決方案可以是來源資料庫的元件,或是授權給第三方的個別解決方案。在後一種情況下,變更追蹤解決方案會在本機上執行,與來源資料庫一併部署,或在 Google Cloud上執行。每個來源都必須與變更追蹤解決方案建立關聯。
[[["容易理解","easyToUnderstand","thumb-up"],["確實解決了我的問題","solvedMyProblem","thumb-up"],["其他","otherUp","thumb-up"]],[["難以理解","hardToUnderstand","thumb-down"],["資訊或程式碼範例有誤","incorrectInformationOrSampleCode","thumb-down"],["缺少我需要的資訊/範例","missingTheInformationSamplesINeed","thumb-down"],["翻譯問題","translationIssue","thumb-down"],["其他","otherDown","thumb-down"]],["上次更新時間:2025-09-04 (世界標準時間)。"],[[["\u003cp\u003eCloud Data Fusion Replication enables real-time, continuous data replication from operational datastores like SQL Server and MySQL into BigQuery for analysis.\u003c/p\u003e\n"],["\u003cp\u003eIt uses change data capture (CDC) to minimize outbound data charges, allowing for efficient replication of only changed records.\u003c/p\u003e\n"],["\u003cp\u003eThe platform offers enterprise scalability, supporting high-volume databases and zero-downtime snapshot replication to ready the data warehouse.\u003c/p\u003e\n"],["\u003cp\u003eReplication includes features for schema assessment, security compliance (like Data Residency, CMEK, and VPC Service Controls), and performance monitoring through dashboards.\u003c/p\u003e\n"],["\u003cp\u003eReplication jobs consist of a source and a target, and are executed using Dataproc clusters, incurring charges for both the clusters and BigQuery processing, with BigQuery flat rate pricing recommended for cost optimization.\u003c/p\u003e\n"]]],[],null,["# Replication overview\n\nCloud Data Fusion Replication lets you create copies of\nyour data continuously and in real time from operational datastores, such as SQL\nServer and MySQL, into [BigQuery](/bigquery/docs).\n\nTo use Replication, choose one of the following ways:\n\n- Create a new instance of Cloud Data Fusion and add the Replication app.\n- Add the Replication app to an existing instance.\n\nBenefits include:\n\n- Identifying schema incompatibilities, connectivity issues, and missing\n features prior to starting replication, then provides corrective actions.\n\n- Using the latest operational data in real time for analysis within\n BigQuery. You use log-based replication directly into\n BigQuery from Microsoft SQL Server (using\n [SQL Server CDC](https://docs.microsoft.com/en-us/sql/relational-databases/track-changes/about-change-data-capture-sql-server?view))\n and MySQL (using [MySQL Binary Log](https://dev.mysql.com/doc/refman/8.0/en/binary-log.html)).\n\n- Change data capture (CDC) providing a representation of data that has changed\n in a stream, allowing computations and processing to focus specifically on\n the most recently changed records. This minimizes outbound data charges on\n sensitive production systems.\n\n- Enterprise scalability supporting high-volume transactional databases Initial\n loads of data to BigQuery are supported with zero-downtime\n snapshot replication, to make the data warehouse ready for consuming changes\n continuously. Once the initial snapshot is done, high-throughput, continuous\n replication of changes starts in real time.\n\n- The dashboards helping you get real-time insights into replication performance.\n It's useful for identifying bottlenecks and monitoring data delivery SLAs.\n\n- Including support for Data Residency, Customer-Managed Encryption Keys (CMEK)\n and VPC Service Controls. Integration of Cloud Data Fusion within\n Google Cloud ensures that the highest levels of enterprise security and\n privacy are observed while making the latest data available in your data\n warehouse for analytics.\n\nRecommended pricing\n-------------------\n\nWhen Replication runs, you're charged for the Dataproc\ncluster and you incur processing costs for BigQuery. To optimize\nthese costs, we strongly recommend that you use [BigQuery flat\nrate pricing](/bigquery/pricing#flat_rate_pricing).\n\nFor more information, see the Cloud Data Fusion\n[Pricing](/data-fusion/pricing) page.\n\nReplication entities\n--------------------\n\nActions\n-------\n\nMonitoring\n----------\n\nTable states\n------------\n\nMetrics\n-------\n\nComponents\n----------\n\n### Connectivity\n\nThe following table describes the network connections required for\nReplication, and the security mechanisms they use.\n\n| **Note:** Replication inherits the networking and security capabilities of Cloud Data Fusion, such as [Private IP](/data-fusion/docs/how-to/create-private-ip), [VPC-SC](/data-fusion/docs/how-to/using-vpc-sc), and [CMEK](/data-fusion/docs/how-to/customer-managed-encryption-keys).\n\nWhat's next\n-----------\n\n- Refer to the [Replication API reference](/data-fusion/docs/reference/replication-ref).\n- Refer to the [data type mappings for Replication](/data-fusion/docs/reference/replication-data-types)."]]