Arena

Arena

Benchmark and compare the best AI models

456 followers

Arena is an open platform to evaluate, benchmark, compare, and test frontier AI models.
This is the 2nd launch from Arena. View more

Arena Agent Mode

Launching today
Get real-world tasks done with autonomous AI agents
Most AI benchmarks test models in controlled environments. Agent Mode tests them on complex tasks to get more work done. Run autonomous agents that browse, research, code, use files, and complete multi-step workflows from a single prompt. Then watch each workflow unfold step by step. Every run contributes to the Agent Arena Leaderboard, ranking frontier models by real-world agentic performance.
Arena Agent Mode gallery image
Arena Agent Mode gallery image
Arena Agent Mode gallery image
Arena Agent Mode gallery image
Arena Agent Mode gallery image
Arena Agent Mode gallery image
Arena Agent Mode gallery image
Arena Agent Mode gallery image
Arena Agent Mode gallery image
Free
Launch Team