AI RESEARCH

From Traditional Taggers to LLMs: A Comparative Study of POS Tagging for Medieval Romance Languages

arXiv CS.AI

ArXi:2605.09147v1 Announce Type: cross Part-of-speech (POS) tagging for Medieval Romance languages remains challenging due to orthographic variation, morphological complexity, and limited annotated resources. This paper presents a systematic empirical evaluation of large language models (LLMs) for POS tagging across three medieval varieties: Medieval Occitan, Medieval Catalan, and Medieval French.