要复制到目标数据库的生产操作数据库。此数据库可位于本地或 Google Cloud上。Cloud Data Fusion Replication 支持 MySQL、Microsoft SQL Server 和 Oracle 源数据库。
变更跟踪解决方案
Cloud Data Fusion 依赖于更改跟踪解决方案来读取源数据库中的更改,而不是在源数据库上运行的代理上运行。该解决方案可以是源数据库的组件,也可以是单独许可的第三方解决方案。在后一种情况下,更改跟踪解决方案在本地运行、与源数据库在同一位置运行或在 Google Cloud上运行。每个来源都必须与更改跟踪解决方案相关联。
[[["易于理解","easyToUnderstand","thumb-up"],["解决了我的问题","solvedMyProblem","thumb-up"],["其他","otherUp","thumb-up"]],[["很难理解","hardToUnderstand","thumb-down"],["信息或示例代码不正确","incorrectInformationOrSampleCode","thumb-down"],["没有我需要的信息/示例","missingTheInformationSamplesINeed","thumb-down"],["翻译问题","translationIssue","thumb-down"],["其他","otherDown","thumb-down"]],["最后更新时间 (UTC):2025-09-04。"],[[["\u003cp\u003eCloud Data Fusion Replication enables real-time, continuous data replication from operational datastores like SQL Server and MySQL into BigQuery for analysis.\u003c/p\u003e\n"],["\u003cp\u003eIt uses change data capture (CDC) to minimize outbound data charges, allowing for efficient replication of only changed records.\u003c/p\u003e\n"],["\u003cp\u003eThe platform offers enterprise scalability, supporting high-volume databases and zero-downtime snapshot replication to ready the data warehouse.\u003c/p\u003e\n"],["\u003cp\u003eReplication includes features for schema assessment, security compliance (like Data Residency, CMEK, and VPC Service Controls), and performance monitoring through dashboards.\u003c/p\u003e\n"],["\u003cp\u003eReplication jobs consist of a source and a target, and are executed using Dataproc clusters, incurring charges for both the clusters and BigQuery processing, with BigQuery flat rate pricing recommended for cost optimization.\u003c/p\u003e\n"]]],[],null,["# Replication overview\n\nCloud Data Fusion Replication lets you create copies of\nyour data continuously and in real time from operational datastores, such as SQL\nServer and MySQL, into [BigQuery](/bigquery/docs).\n\nTo use Replication, choose one of the following ways:\n\n- Create a new instance of Cloud Data Fusion and add the Replication app.\n- Add the Replication app to an existing instance.\n\nBenefits include:\n\n- Identifying schema incompatibilities, connectivity issues, and missing\n features prior to starting replication, then provides corrective actions.\n\n- Using the latest operational data in real time for analysis within\n BigQuery. You use log-based replication directly into\n BigQuery from Microsoft SQL Server (using\n [SQL Server CDC](https://docs.microsoft.com/en-us/sql/relational-databases/track-changes/about-change-data-capture-sql-server?view))\n and MySQL (using [MySQL Binary Log](https://dev.mysql.com/doc/refman/8.0/en/binary-log.html)).\n\n- Change data capture (CDC) providing a representation of data that has changed\n in a stream, allowing computations and processing to focus specifically on\n the most recently changed records. This minimizes outbound data charges on\n sensitive production systems.\n\n- Enterprise scalability supporting high-volume transactional databases Initial\n loads of data to BigQuery are supported with zero-downtime\n snapshot replication, to make the data warehouse ready for consuming changes\n continuously. Once the initial snapshot is done, high-throughput, continuous\n replication of changes starts in real time.\n\n- The dashboards helping you get real-time insights into replication performance.\n It's useful for identifying bottlenecks and monitoring data delivery SLAs.\n\n- Including support for Data Residency, Customer-Managed Encryption Keys (CMEK)\n and VPC Service Controls. Integration of Cloud Data Fusion within\n Google Cloud ensures that the highest levels of enterprise security and\n privacy are observed while making the latest data available in your data\n warehouse for analytics.\n\nRecommended pricing\n-------------------\n\nWhen Replication runs, you're charged for the Dataproc\ncluster and you incur processing costs for BigQuery. To optimize\nthese costs, we strongly recommend that you use [BigQuery flat\nrate pricing](/bigquery/pricing#flat_rate_pricing).\n\nFor more information, see the Cloud Data Fusion\n[Pricing](/data-fusion/pricing) page.\n\nReplication entities\n--------------------\n\nActions\n-------\n\nMonitoring\n----------\n\nTable states\n------------\n\nMetrics\n-------\n\nComponents\n----------\n\n### Connectivity\n\nThe following table describes the network connections required for\nReplication, and the security mechanisms they use.\n\n| **Note:** Replication inherits the networking and security capabilities of Cloud Data Fusion, such as [Private IP](/data-fusion/docs/how-to/create-private-ip), [VPC-SC](/data-fusion/docs/how-to/using-vpc-sc), and [CMEK](/data-fusion/docs/how-to/customer-managed-encryption-keys).\n\nWhat's next\n-----------\n\n- Refer to the [Replication API reference](/data-fusion/docs/reference/replication-ref).\n- Refer to the [data type mappings for Replication](/data-fusion/docs/reference/replication-data-types)."]]