data-diff

Compare tables of any size across databases
0 reviews
18followers
Visit website
Do you use data-diff?
What is data-diff?
Data Diff is an open-source package that can be run in a CLI or wrapped into any data orchestrator such as Airflow, Dagster, etc. Compare datasets quickly (seconds/minutes) at a large (millions/billions of rows) scale across different databases.

Recent launches

data-diff
Open source data-diff keeps getting better! 💫 In our latest release: ⏱ Faster diffing 🦆 DuckDB support! ✨ Store diff results ➕ and more! Check out the full release notes here: https://github.com/datafold/data-diff/releases/tag/v0.3.0
data-diff image
data-diff
Data Diff is an open-source package that can be run in a CLI or wrapped into any data orchestrator such as Airflow, Dagster, etc. Compare datasets quickly (seconds/minutes) at a large (millions/billions of rows) scale across different databases.
data-diff image

💡 All the pro tips

Tips help users get up to speed using a product or feature
📣 Calling all experts and enthusiasts! Share your wisdom and leave a pro tip that will make a difference!