Do you use data-diff?
What is data-diff?
Data Diff is an open-source package that can be run in a CLI or wrapped into any data orchestrator such as Airflow, Dagster, etc. Compare datasets quickly (seconds/minutes) at a large (millions/billions of rows) scale across different databases.
Recent launches
data-diff
Open source data-diff keeps getting better! 💫
In our latest release:
⏱ Faster diffing
🦆 DuckDB support!
✨ Store diff results
➕ and more!
Check out the full release notes here:
https://github.com/datafold/data-diff/releases/tag/v0.3.0
data-diff
Data Diff is an open-source package that can be run in a CLI or wrapped into any data orchestrator such as Airflow, Dagster, etc. Compare datasets quickly (seconds/minutes) at a large (millions/billions of rows) scale across different databases.
💡 All the pro tips
Tips help users get up to speed using a product or feature
📣 Calling all experts and enthusiasts! Share your wisdom and leave a pro tip that will make a difference!