Ritesh Malpani's profile on Product Hunt

All activity

13h ago

BenchSpan is a benchmarking platform for AI agents. Running benchmarks is slow, expensive, and fragile. We fix that. Onboard your agent once (we onboarded Claude Code in 37 lines), run any benchmark in parallel in the cloud, and get every result in one place your whole team can see. When runs fail halfway, rerun just what broke. Compare runs side by side to see exactly where your agent is improving. Stop fighting your benchmarks and start shipping your agent.

BenchspanRun agent benchmarks in minutes, not hours

Ritesh Malpanileft a comment

13h ago

Hey PH 👋, Ritesh from Benchspan here, We were building AI agents and needed to know if they were getting better. Sounds simple. It wasn't. Every benchmark assumed a different interface, days of glue code just to get running. Full suites took 14 hours on a laptop. A single failure at 72% burned $600 in tokens and we'd start from scratch. Nobody on the team trusted anyone else's numbers because...

BenchspanRun agent benchmarks in minutes, not hours