Skip to main content

Key features

Use cases

Data Knowledge Graph

Private Beta — The Data Knowledge Graph is currently in private beta. Contact the Datafold team at sales@datafold.com to enable this for your organization.
The Data Knowledge Graph (DKG) automatically collects and unifies all information about your data ecosystem — lineage, business logic, usage statistics, BI connections, git history, and organizational knowledge — and serves it to your AI agents via MCP. Unlike data catalogs that rely on manual curation, the DKG is sourced and maintained by AI, and optimized for consumption by the coding agents of your choice. It spans all your data sources and code bases, creating a comprehensive view of your entire data platform that is inaccessible to any single provider on their own. The DKG powers Datafold’s specialized agents (such as the Data Migration Agent) and supercharges external coding agents (Claude Code, Cursor, Windsurf) by providing the context they need to produce reliable results for any data engineering task.

Getting started

There are a few ways to get started with your first data diff:
1

Create a data diff

Once you’ve integrated a data connection and code repository, you can run a new in-database or cross-database data diff or explore your data lineage.
2

Create automated monitors

Create monitors to send alerts when data diffs fall outside predefined ranges.
3

Set up CI/CD testing

Get started with deployment testing through our universal (No-Code, API) or dbt integrations.

Learn more