
Model Kombat by HackerRank
The AI Code Arena
222 followers
The AI Code Arena
222 followers
Coding LLMs go head-to-head on real programming tasks. Developers vote on which solution they'd actually ship. These votes become training data for better models. No synthetic tests. Just code, performance, and brutal honesty.










Model Kombat by HackerRank
PicWish
@rafik_matta_hr I really like the “would you ship this?” framing.
Will you also surface why devs picked one solution over another?
Model Kombat by HackerRank
@mohsinproduct yes! That's part of the next release. We're keeping the experience super light right now but we will enable dev written feedback pretty soon
CatDoes
Congrats on the launch. Just tried it out and it's quite fun! :) When can we start testing on our own custom problems?
Model Kombat by HackerRank
@mahdi_nouri we're planning to enable that feature after the launch campaign! Let us know which models you'd like to see and the type of format for problems you're interested in
This looks really promising! Congratulations on the launch! 🎉
When are you planning to introduce more models?
Model Kombat by HackerRank
@sanskaragar16 we'll add more models over the next month. For now we wanted to focus on the models that perform best on some popular benchmarks to compare
Congratulations on the launch! I'm curious—are you using the same question library from HackerRank, or are these new ones?
Model Kombat by HackerRank
@akshat_shah14 new ones but using the same rigorous approach we take to creating questions in general
This is genius, finally a way to benchmark LLMs that actually respects developer standards. “Would you ship this?” is exactly the right question.
BlogBowl
Nice, congratulations on your launch. It took me some time to understand what the product does ;)
Model Kombat by HackerRank
@danpole Hey Dan, thanks for the feedback! Please let me know anything that would help make it clearer.
Incredibly proud of what our team has built! Excited to see which models developers put their trust in the most.