Mind the Gap: How Elicitation Protocols Shape the Stated-Revealed Preference Gap in Language Models

ArXi:2601.21975v2 Announce Type: replace Recent work identifies a stated-revealed (SvR) preference gap in language models (LMs): a mismatch between the values models endorse and the choices they make in context. Existing evaluations rely heavily on binary forced-choice prompting, which entangles genuine preferences with artifacts of the elicitation protocol. We systematically study how elicitation protocols affect SvR correlation across 24 LMs.