Ajay Khatri

Ajay Khatri

Founder's Office
Future AGI
•1 review
After trying to duct-tape together our own eval stack, we finally gave this a shot. It does what you’d expect: flags model issues, tracks performance, and keeps your iterations grounded in reality. Long overdue in this space.

What's great

error detection (5)model performance tracking (1)
1 view