matthew david

matthew david

data-diff
4 points

Forums

matthew david

3yr ago

data-diff - Compare tables of any size across databases

Data Diff is an open-source package that can be run in a CLI or wrapped into any data orchestrator such as Airflow, Dagster, etc. Compare datasets quickly (seconds/minutes) at a large (millions/billions of rows) scale across different databases.
matthew david

3yr ago

data-diff - Efficiently diff data in or across relational databases

Open source data-diff keeps getting better! 💫 In our latest release: ⏱ Faster diffing 🦆 DuckDB support! ✨ Store diff results ➕ and more! Check out the full release notes here: https://github.com/datafold/data...