INFO

Column-level Lineage is not currently supported for Dremio.

INFO

Schemas for tables in external data sources need to be specified with quotes e.g., “Postgres prod.analytics.sales”.

Steps to complete:

  1. Configure user in Dremio
  2. Create schema for Datafold
  3. Configure your data connection in Datafold

Configure user in Dremio

To connect to Dremio, create a user with read-only access to all data sources you wish to diff and generate an access token.

Temporary tables will be created in the $scratch schema that doesn’t require special permissions.

Create schema for Datafold

Datafold utilizes a temporary dataset to materialize scratch work and keep data processing in the your warehouse.

Configure in Datafold

Field NameDescription
Connection nameA name given to the data connection within Datafold.
HostThe hostname for your Dremio instance (data.dremio.cloud for Dremio SaaS).
PortDremio endpoint port; default value is 433.
EncryptionShould be checked for Dremio Cloud, possibly unchecked for local deployments.
User IDUser ID as created in Dremio, typically an email address.
Project IDDremio Project UID. If left blank, the default project will be used.
TokenAccess token generated in Dremio.
PasswordAlternatively, provide a password.
Schema for temporary viewsA Dremio space for temporary views.
Schema for temporary tables$scratch should suit most applications, or use “<Datasource>.<schema>” (with quotes) if you wish to create temporary tables in an external data source.

Click Create. Your data connection is now ready!