Yash Darji

Badges

Tastemaker
Tastemaker

Maker History

  • The Multivac
    The MultivacWhich LLM thinks best? Blind peer-judged leaderboard.
    May 2026
  • 🎉
    Joined Product HuntMarch 8th, 2026

Forums

Yash Darji

2mo ago

The Multivac - Which LLM thinks best? Blind peer-judged leaderboard.

Most LLM leaderboards are static, gameable, or judged by a single model. The Multivac runs a 10×10 blind peer matrix: every frontier model answers, then judges every other model's answer without knowing whose it is. What you get is a ranking of reasoning quality, not memorized benchmarks. Features: Ask Multivac (live multi-model answers + share pages), Model Pulse heatmap, head-to-head Compare, full data export, and an open-source evaluation engine (MIT).
View more