Create an Amazon Redshift Data Stream
Create a data stream to start the flow of data from your Amazon Redshift source to Data Cloud.
| User Permissions Needed | | ------------------------ | ---------------------------------------------------- | | To create a data stream: | Data Cloud Admin OR Data Cloud Data Aware Specialist |
Before you begin:
- Make sure that the Amazon Redshift connection is set up.
- Review IP addresses to make sure that the Redshift connection has the necessary access.
-
In Data Cloud, on the Data Streams tab, click New.
You can also use App Launcher to find and select Data Streams.
-
Under Other Sources, select the Amazon Redshift connection source, and click Next.
-
Select from the available Amazon Redshift connections.
-
Select the database that you want to use.
-
From the Available Objects, select the objects that you want to include, and click Next.
-
Select the category to specify the type of data to ingest.
-
For Primary Key, select a unique field to identify a record.
-
(Optional) Select a record modified field.
If data is received out of order, the record modified field provides a reference point to determine whether to update the record. The record with the most up-to-date timestamp is loaded.
-
(Optional) For Organization Unit Identifier, select a business unit to use in a record’s data lineage.
-
Select the source field and click Next.
Fields with convertible data types are listed under Supported Fields.
-
For Data Space, if the default data space isn’t selected, assign the data stream to the appropriate data space.
-
If you want to query the data in your Amazon Redshift database with reduced latency, select Enable acceleration and choose the acceleration schedule.
-
Click Deploy.
You can now map your data lake object to the semantic data model to use the data in segments, calculated insights, and other use cases.