AI RESEARCH
TurkicNLP: An NLP Toolkit for Turkic Languages
arXiv CS.CL
•
ArXi:2602.19174v3 Announce Type: replace Natural language processing for the Turkic language family, spoken by over 200M people across Eurasia, remains fragmented, with most languages lacking unified tooling and resources. We present TurkicNLP, an open-source Python library providing a single, consistent NLP pipeline for Turkic languages across four script families: Latin, Cyrillic, Perso-Arabic, and Old Turkic Runic.