Use third party transfers

Third party transfers for BigQuery Data Transfer Service allow you to automatically schedule and manage recurring load jobs for external data sources such as Salesforce CRM, Adobe Analytics, and Facebook Ads.

Before you begin

Before you create a third party data transfer:

  • Verify that you have completed all actions required to enable the BigQuery Data Transfer Service.
  • Create a BigQuery dataset to store the data.
  • Ensure that the person creating the transfer has the following required permissions in BigQuery:

    • bigquery.transfers.update permissions to create the transfer
    • Both bigquery.datasets.get and bigquery.datasets.update permissions on the target dataset

    The bigquery.admin predefined IAM role includes bigquery.transfers.update, bigquery.datasets.update and bigquery.datasets.get permissions. For more information on IAM roles in BigQuery Data Transfer Service, see Access control.

  • Consult the documentation for your third party data source to ensure you have configured any permissions necessary to enable the transfer.

  • If you intend to setup transfer run notifications for Pub/Sub, you must have pubsub.topics.setIamPolicy permissions. Pub/Sub permissions are not required if you just set up email notifications. For more information, see BigQuery Data Transfer Service run notifications.

Limitations

Third party transfers are subject to the following limitations:

  • You must create or update a third party transfer by using the Google Cloud console.
  • Currently, you cannot configure or update a third party transfers by using the bq command-line tool.

Set up a third party data transfer

To create a third party data transfer by using the Google Cloud console:

  1. Go to the Google Cloud Marketplace.

    Go to the Google Cloud Marketplace

  2. Click the appropriate third party provider.

  3. On the documentation page for the third party provider, click Enroll. The enrollment process may take a moment.

  4. After the enrollment is complete, click Configure Transfer.

  5. On the Create Transfer page:

    • For Source, choose the appropriate third party data source. You can click Explore Data Sources to see the list of third party providers in the Google Cloud Marketplace.

      Transfer source

    • For Display name, enter a name for the transfer such as My Transfer. The transfer name can be any value that allows you to easily identify the transfer if you need to modify it later.

      Transfer name

    • For Schedule, leave the default value (Start now) or click Start at a set time.

      • For Repeats, choose an option for how often to run the transfer. Options include:

        • Daily (default)
        • Weekly
        • Monthly
        • Custom
        • On-demand

        If you choose an option other than Daily, additional options are available. For example, if you choose Weekly, an option appears for you to select the day of the week.

      • For Start date and run time, enter the date and time to start the transfer. If you choose Start now, this option is disabled.

        Transfer schedule

    • For Destination dataset, choose the dataset you created to store your data.

      Transfer dataset

    • (Optional) In the Notification options section:

      • Click the toggle to enable email notifications. When you enable this option, the transfer administrator receives an email notification when a transfer run fails.
      • For Select a Pub/Sub topic, choose your topic name or click Create a topic to create one. This option configures Pub/Sub run notifications for your transfer.
  6. Click Connect Source.

  7. When prompted, click Accept to give the BigQuery Data Transfer Service permission to connect to the data source and to manage your data in BigQuery.

  8. Follow the instructions in the subsequent pages to configure the connection to your external data source.

  9. After you complete the configuration steps, click Save.

Troubleshoot third party transfer setup

If you are having issues setting up your transfer, consult the appropriate third party vendor. Contact information is available on the transfer's documentation page in the Google Cloud Marketplace.

Query your data

When your data is transferred to BigQuery, the data is written to ingestion-time partitioned tables. For more information, see Introduction to partitioned tables.

If you query your tables directly instead of using the auto-generated views, you must use the _PARTITIONTIME pseudo-column in your query. For more information, see Querying partitioned tables.

What's next