Create an Amazon Redshift Data Stream

Create a data stream to start the flow of data from your Amazon Redshift source to Data Cloud.

| User Permissions Needed | | ------------------------ | ---------------------------------------------------- | | To create a data stream: | Data Cloud Admin OR Data Cloud Data Aware Specialist |

Before you begin:

  • Make sure that the Amazon Redshift connection is set up.
  • Review IP addresses to make sure that the Redshift connection has the necessary access.
  1. In Data Cloud, on the Data Streams tab, click New.

    You can also use App Launcher to find and select Data Streams.

  2. Under Other Sources, select the Amazon Redshift connection source, and click Next.

  3. Select from the available Amazon Redshift connections.

  4. Select the database that you want to use.

  5. From the Available Objects, select the objects that you want to include, and click Next.

  6. Select the category to specify the type of data to ingest.

  7. For Primary Key, select a unique field to identify a record.

  8. (Optional) Select a record modified field.

    If data is received out of order, the record modified field provides a reference point to determine whether to update the record. The record with the most up-to-date timestamp is loaded.

  9. (Optional) For Organization Unit Identifier, select a business unit to use in a record’s data lineage.

  10. Select the source field and click Next.

    Fields with convertible data types are listed under Supported Fields.

  11. For Data Space, if the default data space isn’t selected, assign the data stream to the appropriate data space.

  12. If you want to query the data in your Amazon Redshift database with reduced latency, select Enable acceleration and choose the acceleration schedule.

  13. Click Deploy.

You can now map your data lake object to the semantic data model to use the data in segments, calculated insights, and other use cases.