AI RESEARCH

GLeMM: A large-scale multilingual dataset for morphological research

arXiv CS.CL

ArXi:2604.12442v1 Announce Type: new In derivational morphology, what mechanisms govern the variation in form-meaning relations between words? The answers to this type of questions are typically based on intuition and on observations drawn from limited data, even when a wide range of languages is considered. Many of these studies are difficult to replicate and generalize. To address this issue, we present GLeMM, a new derivational resource designed for experimentation and data-driven description in morphology.