Jeremy Wang

Jeremy Wang

Benchmarking LLMs via Werewolf (Mafia)
Mentiss
Mentiss Benchmarking and Training AI's Social Intelligence.