Athena
Steps to complete:
Create an S3 bucket
If you don’t already have an S3 bucket for your cluster, you’ll need to create one. Datafold uses this bucket to create temporary tables and store data in it. You can learn how to create an S3 bucket in AWS by referring to the AWS documentation.
Run SQL Script and Create Schema for Datafold
To connect to AWS Athena, you must generate an AWS Access Key ID
and an AWS Secret Access Key
. These keys provide read-only access to all tables in all schemas and write access to the Datafold-specific schema for temporary tables. If you don’t have these keys yet, follow the steps outlined in the AWS documentation.
Datafold utilizes a temporary dataset to materialize scratch work and keep data processing in the your warehouse.
Configure in Datafold
Field Name | Description |
---|---|
AWS Access Key ID | Your AWS Access Key, which can be found in your AWS Account. |
AWS Secret Access Key | The AWS Secret Key (generate it in your AWS account if you don’t have it yet). |
S3 Staging Directory | The S3 bucket where table data is stored. |
AWS Region | The region of your Athena cluster. |
Catalog | The catalog, which is typically awsdatacatalog by default. |
Database | The database or schema with tables, typically default by default. |
Schema for Temporary Tables | The schema (datafold_tmp) created in our SQL script. |
Click Create to complete the setup of your data connection in Datafold.