AI RESEARCH
In Data or Invisible: Toward a Better Digital Representation of Low-Resource Languages with Knowledge Graphs
arXiv CS.AI
•
ArXi:2605.05931v1 Announce Type: new Emerging digital technologies are exacerbating the existing divide in Open Access Data (OAD) between high-and low-resource languages, excluding many communities from participating in the global digital transformation. In this PhD proposal, we aim to address this gap, focusing on the language coverage of Linked Open Data knowledge graphs (LOD KGs). First, we identify key variables that characterize language distribution in LOD, including the number of Wikipedia articles per language edition and the number of language-tagged entities in LOD KGs.