AI RESEARCH
LuxBorrow: From Pompier to Pompjee, Tracing Borrowing in Luxembourgish
arXiv CS.CL
•
ArXi:2603.10789v1 Announce Type: new We present LuxBorrow, a borrowing-first analysis of Luxembourgish (LU) news spanning 27 years (1999-2025), covering 259,305 RTL articles and 43.7M tokens. Our pipeline combines sentence-level language identification (LU/DE/FR/EN) with a token-level borrowing resolver restricted to LU sentences, using lemmatization, a collected loanword registry, and compiled morphological and orthographic rules.