Data Connections
Redshift
Steps to complete:
Run SQL script and create schema for Datafold
To connect to Amazon Redshift, you must create a user with the following permissions:
- Read-only access to all tables in all schemas
- Write access to a dedicated temporary schema for Datafold
- Access to SQL logs for lineage construction
Datafold uses a temporary dataset to materialize scratch work and keep data processing in the your warehouse. Create the schema with:
Next, create the Datafold user. To grant read access to all schemas, the user must have superuser-level privileges in Redshift:
Grant unrestricted access to system logs so Datafold can build column-level lineage:
Datafold utilizes a temporary schema, named
datafold_tmp
in the above script, to materialize scratch work and keep data processing in your warehouse.Configure in Datafold
Field Name | Description |
---|---|
Name | A name given to the data connection within Datafold |
Host | The hostname of your cluster. (Go to Redshift in your AWS console, select your cluster, the hostname is the endpoint listed at the top of the page) |
Port | Redshift connection port; default value is 5439 |
User | The user created in our SQL script, named datafold |
Password | The password created in our SQL script |
Database Name | The name of the Redshift database you want to connect to |
Schema for temporary tables | The schema (datafold_tmp ) created in our SQL script |
Click Create. Your data connection is ready!