Data Diff is an open-source package that can be run in a CLI or wrapped into any data orchestrator such as Airflow, Dagster, etc. Compare datasets quickly (seconds/minutes) at a large (millions/billions of rows) scale across different databases.
Recent launches
data-diff
Open source data-diff keeps getting better! 💫
In our latest release:
⏱ Faster diffing
🦆 DuckDB support!
✨ Store diff results
➕ and more!
Check out the full release notes here:
https://github.com/datafold/data-diff/releases/tag/v0.3.0
data-diff
Data Diff is an open-source package that can be run in a CLI or wrapped into any data orchestrator such as Airflow, Dagster, etc. Compare datasets quickly (seconds/minutes) at a large (millions/billions of rows) scale across different databases.