Evaluating Adjective-Noun Compositionality in LLMs: Functional vs Representational Perspectives

ArXi:2603.09994v1 Announce Type: cross Compositionality is considered central to language abilities. As performant language systems, how do large language models (LLMs) do on compositional tasks? We evaluate adjective-noun compositionality in LLMs using two complementary setups: prompt-based functional assessment and a representational analysis of internal model states. Our results reveal a striking divergence between task performance and internal states.