AI RESEARCH

The Wikidata Query Logs Dataset

arXiv CS.CL

ArXi:2602.14594v2 Announce Type: replace We present the Wikidata Query Logs (WDQL) dataset, a dataset consisting of 335k question-query pairs over the Wikidata knowledge graph. It is over 11x larger than the largest existing Wikidata datasets of similar format without relying on template-generated queries. Instead, we construct it using real-world SPARQL queries sent to the Wikidata Query Service and generate questions for them.