Skip to main content

Starburst

info

Column-level Lineage is not currently supported for Starburst.

Steps to complete:

  1. Configure user in Starburst
  2. Create schema for Datafold
  3. Configure your data connection in Datafold

Configure user in Starburst

To connect to Starburst, create a user with read-only access to all data sources you wish to diff and optionally generate an access token. Datafold requires a schema to be set up within one of the catalogs, typically hosted on platforms like Amazon S3 or similar services.

Create schema for Datafold

Datafold utilizes a temporary dataset to materialize scratch work and keep data processing in the your warehouse.

Configure in Datafold

Field NameDescription
Connection nameA name given to the data connection within Datafold.
HostThe hostname for your Starburst instance (e.g., sample-free-cluster.trino.galaxy.starburst.io for Starburst SaaS).
PortStarburst endpoint port; default value is 433.
EncryptionShould be checked for Starburst Galaxy, possibly unchecked for local deployments.
User IDUser ID as created in Starburst, typically an email address.
TokenAccess token generated in Starburst.
PasswordAlternatively, provide a password.
Schema for temporary tablesUse <catalog>.<schema> format.

Click Create. Your data source is now ready!