AI RESEARCH

A Catalog of Basque Dialectal Resources: Online Collections and Standard-to-Dialectal Adaptations

arXiv CS.CL

ArXi:2603.25189v1 Announce Type: new Recent research on dialectal NLP has identified data scarcity as a primary limitation. To address this limitation, this paper presents a catalog of contemporary Basque dialectal data and resources, offering a systematic and comprehensive compilation of the dialectal data currently available in Basque. Two types of data sources have been distinguished: online data originally written in some dialect, and standard-to-dialect adapted data.