Chaos

Chaos

Make clean datasets dirty!

7 followers

Most publicly available datasets are pre-cleaned, making it difficult to practice data cleaning and management skills with authentic, raw data. To bridge this gap, we developed Chaos—a web application that generates messy datasets from clean data.
Chaos gallery image
Chaos gallery image
Chaos gallery image
Chaos gallery image
Free
Launch Team / Built With

What do you think? …

Ridwan Adejumo Suleiman
Real-world data is often messy and complex, quickly becoming overwhelming for individuals seeking to improve their data cleaning and management skills. Accessing authentic, messy datasets can be challenging, as most datasets available on platforms like Kaggle and other repositories are pre-cleaned, making them far removed from the realities of working with raw data. To address this gap, we developed Chaos—a web application designed to generate messy datasets from clean data. Inspired by Nicola Rennie's brilliant work in the messy R package, this tool is ideal for data scientists, educators, and developers who want to stress-test their data pipelines or teach data cleaning in a controlled environment.