AI RESEARCH

Emotion Concepts and their Function in a Large Language Model

arXiv CS.CL

ArXi:2604.07729v1 Announce Type: cross Large language models (LLMs) sometimes appear to exhibit emotional reactions. We investigate why this is the case in Claude Sonnet 4.5 and explore implications for alignment-relevant behavior. We find internal representations of emotion concepts, which encode the broad concept of a particular emotion and generalize across contexts and behaviors it might be linked to.