Configure an Amazon RDS PostgreSQL database

The following page covers how to configure CDC for an Amazon RDS for PostgreSQL database.

Create a parameter group

  1. Launch your Amazon RDS Dashboard.

  2. In the Navigation Drawer, click Parameter Groups, and then click Create Parameter Group. The Create Parameter Group page appears.

  3. Select the database family that matches your database, provide a name and description for the parameter group, and then click Create.

  4. Select the checkbox to the left of your newly created parameter group, and then, under Parameter group actions, click Edit.

  5. Set the following parameter for your group.

    ParameterValue
    rds.logical_replication1
  6. Click Save Changes.

Configure the source database

  1. Launch your Amazon RDS Dashboard.

  2. In the Navigation Drawer, click Databases.

  3. Select your source, and then click Modify.

  4. Go to the Additional configuration section.

  5. Select the parameter group that you created.

  6. Click Continue.

  7. Under Scheduling of modifications, select Apply immediately.

Verify that the parameter group is assigned to the database instance

  1. Launch your Amazon RDS Dashboard.

  2. In the Navigation Drawer, click Databases, and then select your database instance.

  3. Click the Configurations tab.

  4. Verify that you see the parameter group that you created, and that its status is pending-reboot.

  5. Reboot your database instance to complete the configuration. To reboot the instance:

    1. In the Navigation Drawer, click Instances.
    2. Select your database instance.
    3. From the Instance Actions menu, select Reboot.

Create a publication and a replication slot

  1. Create a publication. We recommend that you create a publication only for the tables that you want to replicate. This allows Datastream to read-only the relevant data, and lowers the load on the database and Datastream:

    CREATE PUBLICATION PUBLICATION_NAME
    FOR TABLE SCHEMA1.TABLE1, SCHEMA2.TABLE2;

    Replace the following:

    • PUBLICATION_NAME: The name of your publication. You'll need to provide this name when you create a stream in the Datastream stream creation wizard.
    • SCHEMA: The name of the schema that contains the table.
    • TABLE: The name of the table that you want to replicate.

    You can create a publication for all tables in a schema. This approach lets you replicate changes for tables in the specified list of schemas, including tables that you create in the future:

    CREATE PUBLICATION PUBLICATION_NAME
    FOR TABLES IN SCHEMA SCHEMA1, SCHEMA2;

    You can also create a publication for all tables in your database. Note that this approach increases the load on both the source database and Datastream:

    CREATE PUBLICATION PUBLICATION_NAME FOR ALL TABLES;
    
  2. Create a replication slot by entering the following PostgreSQL command:

    SELECT PG_CREATE_LOGICAL_REPLICATION_SLOT('REPLICATION_SLOT_NAME', 'pgoutput'); 

    Replace the following:

    • REPLICATION_SLOT_NAME: The name of your replication slot. You'll need to provide this name when you create a stream in the Datastream stream creation wizard.

Create a Datastream user

  1. To create a Datastream user, enter the following PostgreSQL command:

    CREATE USER USER_NAME WITH ENCRYPTED PASSWORD 'USER_PASSWORD';
    

    Replace the following:

    • USER_NAME: The name of the Datastream user that you want to create.
    • USER_PASSWORD: The login password for the Datastream user that you want to create.
  2. Grant the following privileges to the user you created:

    GRANT RDS_REPLICATION TO USER_NAME;
    GRANT SELECT ON ALL TABLES IN SCHEMA SCHEMA_NAME TO USER_NAME;
    GRANT USAGE ON SCHEMA SCHEMA_NAME TO USER_NAME;
    ALTER DEFAULT PRIVILEGES IN SCHEMA SCHEMA_NAME
      GRANT SELECT ON TABLES TO USER_NAME;
    

    Replace the following:

    • USER_NAME: The user to whom you want to grant the privileges.
    • SCHEMA_NAME: The name of the schema to which you want to grant the privileges.

What's next