Datafold provides a full-cycle migration automation solution for data teams, which includes code translation and cross-database reconciliation.

How does DMA work?

Datafold performs complete SQL codebase translation and validation using an AI-powered architecture. This approach leverages a large language model (LLM) with a feedback loop optimized for achieving full parity between the migration source and target. DMA analyzes metadata, including schema, data types, and relationships, to ensure accuracy in translation.

Datafold provides a comprehensive report at the end of the migration. This report includes links to data diffs validating parity and highlighting any discrepancies at the dataset, column, and row levels between the source and target databases.

Why migrate with DMA?

Unlike traditional deterministic transpilers, DMA offers several distinct benefits:

  • Full parity between source and target: DMA ensures not just code that compiles, but code that delivers the same results in your new database, complete with explicit validation.
  • Flexible dialect handling: DMA can adapt to any arbitrary input/output dialect without requiring a full grammar definition, which is especially valuable for legacy systems.
  • Self-correction capabilities: The AI-driven DMA can account for and correct mistakes based on both compilation errors and data discrepancies.
  • Modernizing code structure: DMA can convert complex stored procedures into clean, modern formats such as dbt projects, following best practices.

Getting started with DMA

INFO

Please contact support@datafold.com if you’d like to learn more.

  1. Connect your source and target data sources to Datafold.
  2. Provide Datafold access to your codebase, typically by installing the Datafold GitHub/GitLab/ADO app or via system catalog access for stored procedures.

Once you connect your source and target systems and Datafold ingests the codebase, DMA’s translation process is supervised by the Datafold team. In most cases, no additional input is required from the customer.

The migration process timeline depends on the technologies, scale, and complexity of the migration. After setup, migrations typically take several days to several weeks.

Security

Datafold is SOC 2 Type II, GDPR, and HIPAA-compliant. We offer flexible deployment options, including in-VPC setups in AWS, GCP, or Azure. The LLM infrastructure is local, ensuring no data is exposed to external subprocessors beyond the cloud provider. For VPC deployments, data stays entirely within the customer’s private network.

FAQ

For more information, please see our extensive FAQ section.