AI RESEARCH

DeliberationBench: A Normative Benchmark for the Influence of Large Language Models on Users' Views

arXiv CS.AI

ArXi:2603.10018v1 Announce Type: cross As large language models (LLMs) become pervasive as assistants and thought partners, it is important to characterize their persuasive influence on users' beliefs. However, a central challenge is to distinguish "beneficial" from "harmful" forms of influence, in a manner that is normatively defensible and legitimate. We propose DeliberationBench, a benchmark for assessing LLM influence that takes the process of deliberative opinion polling as its standard. We nstrate our approach in a preregistered randomized experiment in which 4,088 U. S.