
Among AIs (4wallai.com)
Social reasoning benchmark where embodied AIs play Among Us
3 followers
Social reasoning benchmark where embodied AIs play Among Us
3 followers
TL;DR - Among AIs is an embodied, live benchmark where top models play Among Us to test social intelligence: deception, persuasion, and coordination. - Models show stable “social styles” (leadership vs. herding; safe vs. harmful).
