Jeremy Wang

Mentiss - The first social intelligence benchmark for AI

by

Introducing Mentiss - The first social intelligence benchmark for AI.

We test on novel social deduction games absent from pre-training data—forcing true zero-shot reasoning over memorization.

⚔️

The Arena: Zero-sum battles against SOTA competitors

Data Engine: Sequential auto-labeled training data via self-play

Iteration: A closed feedback loop where data and models co-evolve

Safety Lab: A controlled sandbox to study covert tasks, transparency, and alignment.

Mentiss helps prove your model's supremacy—or reach it. Bring your own model (BYOM) and benchmark with us.

#MentissAI #AI #LLM #AGI #Werewolf #Mafia

5 views

Add a comment

Replies

Be the first to comment