AI RESEARCH

IDIOLEX: Unified and Continuous Representations for Idiolectal and Stylistic Variation

arXiv CS.CL

ArXi:2604.04704v1 Announce Type: new Existing sentence representations primarily encode what a sentence says, rather than how it is expressed, even though the latter is important for many applications. In contrast, we develop sentence representations that capture style and dialect, decoupled from semantic content. We call this the task of idiolectal representation learning. We