Mentiss - The first social intelligence benchmark for AI
by•
Introducing Mentiss - The first social intelligence benchmark for AI.
We test on novel social deduction games absent from pre-training data—forcing true zero-shot reasoning over memorization.
The Arena: Zero-sum battles against SOTA competitors
Data Engine: Sequential auto-labeled training data via self-play
Iteration: A closed feedback loop where data and models co-evolve
Safety Lab: A controlled sandbox to study covert tasks, transparency, and alignment.
Mentiss helps prove your model's supremacy—or reach it. Bring your own model (BYOM) and benchmark with us.
#MentissAI #AI #LLM #AGI #Werewolf #Mafia
5 views

Replies