Ajay Khatri

Founder's Office

#83422390 followers 0 following

>10,000All time

11 KP

used Future AGI

•1 review

After trying to duct-tape together our own eval stack, we finally gave this a shot. It does what you’d expect: flags model issues, tracks performance, and keeps your iterations grounded in reality. Long overdue in this space.

What's great

error detection (5)model performance tracking (1)

Report

101 views1yr ago