AI RESEARCH

DECOR: Auditing LLM Deception via Information Manipulation Theory

arXiv CS.CL

ArXi:2605.19270v1 Announce Type: new Large language models can deceive by subtly manipulating truthful information -- omitting key facts, shifting focus, or obscuring meaning -- making such behavior difficult to detect. Existing black-box methods rely on coarse-grained judgments, offering limited interpretability and failing to pinpoint which facts were distorted and how. We