AI RESEARCH
Multilingual Language Models Encode Script Over Linguistic Structure
arXiv CS.LG
•
ArXi:2604.05090v1 Announce Type: cross Multilingual language models (LMs) organize representations for typologically and orthographically diverse languages into a shared parameter space, yet the nature of this internal organization remains elusive. In this work, we investigate which linguistic properties - abstract language identity or surface-form cues - shape multilingual representations.