Steps to complete:

  1. Generate a Personal Access Token
  2. Retrieve SQL warehouse settings
  3. Create schema for Datafold
  4. Configure your data connection in Datafold

Generate a Personal Access Token

Visit SettingsUser Settings, and then switch to Personal Access Tokens tab.

Then, click Generate new token. Save the generated token somewhere, you’ll need it later on.

Retrieve SQL warehouse settings

In SQL mode, navigate to SQL Warehouses.

Choose the preferred warehouse and copy the following fields values from its Connection Details tab:

  • Server hostname
  • HTTP path

Create schema for Datafold

Datafold utilizes a temporary dataset to materialize scratch work and keep data processing in the your warehouse.

Configure in Datafold

Field NameDescription
NameA name given to the data connection within Datafold
HostThe hostname retrieved in the Connection Details tab
HTTP PathThe HTTP Path retrieved in the Connection Details tab
Access TokenThe token retrieved in Generate a Personal Access Token
CatalogThe catalog and schema name of your Databricks account. Formatted as catalog_name.schema_name (In most cases, catalog_name is hive_metastore.)
Dataset for temporary tablesCertain operations require Datafold to materialize intermediate results, which are stored in a dedicated schema. The input for this field should be in the catalog_name.schema_name format. (In most cases, catalog_name is hive_metastore.)

Click Create. Your data connection is ready!