Niels Bantilan

Pandera - Protect your data and ML products from low-quality data

Pandera is the open source framework for data testing, built for data scientists and ML engineers. It provides a simple, zero-configuration Python library to help you write complex schemas with ease and validate critical points of your pipeline in production.

Add a comment

Replies

Best
Niels Bantilan
Hi everyone, I’m the Chief ML Engineer at Union.ai, and I’m super excited to share the launch of Pandera on Product Hunt 🎉 I’ve been working with data and building models for a decade, and one of the biggest pain points for me is working with low-quality data ❌. I got burned by incorrect data types and unexpected values so many times that I built a Pandera to help you safeguard your pipelines from silent data bugs 🐞. Pandera provides a simple, zero-configuration interface for defining schemas for your dataframe-like objects to help you validate not only data, but your data transformation functions ✅. Give it a try here! https://pandera.readthedocs.io/e...
Samhita Alla
Pandera is an excellent data validation tool; love it!
sumanth reddy
Excited for this launch and Congratulations @niels_bantilan