Atlas
Independent Evals and Benchmarks for GenAI models
6 followers
Independent Evals and Benchmarks for GenAI models
6 followers
Atlas, by LayerLens, is a community resource intended to provide insights about the performance of the top AI models through evals on benchmarks such as MATH, HumanEval, and MMLU. We are data-first, and provide a full suite of analytics for our benchmarks.




Mukh.1
A clean and credible view of how AI models actually perform. Congrats on the launch. We just launched Mukh.1 too — AI agents that take care of the everyday stuff. Give it a look!