AI RESEARCH

CWoMP: Morpheme Representation Learning for Interlinear Glossing

arXiv CS.CL

ArXi:2603.18184v1 Announce Type: new Interlinear glossed text (IGT) is a standard notation for language documentation which is linguistically rich but laborious to produce manually. Recent automated IGT methods treat glosses as character sequences, neglecting their compositional structure. We propose CWoMP (Contrastive Word-Morpheme Pre