AI RESEARCH
YoNER: A New Yor\`ub\'a Multi-domain Named Entity Recognition Dataset
arXiv CS.CL
•
ArXi:2604.05624v1 Announce Type: new Named Entity Recognition (NER) is a foundational NLP task, yet research in Yor\`ub\'a has been constrained by limited and domain-specific resources. Existing resources, such as MasakhaNER (a manually annotated news-domain corpus) and WikiAnn (automatically created from Wikipedia), are valuable but restricted in domain coverage. To address this gap, we present YoNER, a new multidomain Yor\`ub\'a NER dataset that extends entity coverage beyond news and Wikipedia.