Looker
Create a Code Repositories Integration
Create a Code Repositories integration that connects Datafold to your Looker repository.
Create a Looker Integration
Navigate to Settings > Integrations > Data Apps and add a Looker Integration.
Complete the configuration by specifying the following fields:
Field Name | Description |
---|---|
Integration name | An identifier used in Datafold to identify this Data App configuration. |
Project Repository | Select the same repository as used in your Looker project. |
API Host URL | The Looker API Host URL. It has the following format: https://<instance_name>.cloud.looker.com:<port> . The port defaults are 19999 (legacy) and 443 (new), see the Looker Docs for hints. Examples: Legacy (https://datafold.cloud.looker.com:19999 ), New (https://datafold.cloud.looker.com:443 ) |
Client ID | Follow these steps to generate Client ID and Client Secret. These are always user specific. We recommend using a group email for continuity. See Looker User Minimum Access Policy for the required permissions. |
Client Secret | See Client ID |
Data source mapping | When the correct credentials are entered we will begin to populate data sources in Looker (on the left side) that will need to be mapped to data sources configured in Datafold (on the right side). See image below. |
When completed, click Submit.
It may take some time to sync all the Looker entities to Datafold and for Lineage to populate. When completed, your Looker assets will appear in Lineage as search results.
Tracking Jobs explains how to find out when your data app integration is ready.
Looker User Minimum Access Policy
The user linked to the API credentials needs the predefined Developer role or you can create a custom role with these permissions:
access_data
download_without_limit
explore
login_special_email
manage_spaces
see_drill_overlay
see_lookml
see_lookml_dashboards
see_looks
see_pdts
see_sql
see_user_dashboards
send_to_integration
Providing Context for Connecting Looker to Your Database and Schema
Using the Fully Qualified Names in your Looker view files is not always possible. There are 3 different ways Datafold can construct the Fully Qualified Names.
- Using the (default) database configured in the Data Source configuration in the Datafold Console.
- Providing it through the Looker API. Unfortunately, this requires the Looker user to have the Admin role attached.
- Adding User Attributes to the Looker user.
- Setting
datafold_default_database
to the database name you want to connect Looker Lineage to. - (BigQuery only) Setting
datafold_default_host
to the project id.
- Setting
Order of Precedence:
- The Fully Qualified names defined in your Looker view files (if any) take precedence over
- The User Attributes set for the Looker user, which take precedence over
- What we get from the Looker API, which takes precedence over
- The (default) database configured in the Data Source configuration.
Current Limitations of Datafold Looker Integration
Datafold lets you connect to Looker and extend our capabilities to your Looker Views, Explores, Looks, and Dashboards. But this is a new feature, so there are some things we don’t support yet:
- PDT/Derived Tables:Datafold only works with the tables that come from your data sources, but not with the tables that Looker makes from your SQL queries.
- Merge Queries: Datafold supports the Queries and Looks that make up your Dashboards, but Merge Queries are not one of them. For some use cases you could achieve the same by joining the underlying views with an explore.
- Usage metrics and popularity: Datafold shows you your Looker objects - such as dashboards, looks, and fields - but not how much you use or like them.
We are improving our Looker integration and adding more features soon. We welcome your feedback and suggestions.