AI RESEARCH
12 Angry AI Agents: Evaluating Multi-Agent LLM Decision-Making Through Cinematic Jury Deliberation
arXiv CS.AI
•
ArXi:2605.01986v1 Announce Type: new What if the twelve jurors of Sidney Lumet's 12 Angry Men were not men, but large language models? Would the one juror who disagrees still be able to change everyone's mind? This paper instantiates that scenario as a multi-agent benchmark for LLM deliberation: twelve agents, each conditioned on a film-faithful persona, debate the film's murder case using multi-agent framework.