AI RESEARCH
SimpleQA Verified: A Reliable Factuality Benchmark to Measure Parametric Knowledge
arXiv CS.CL
•
ArXi:2509.07968v2 Announce Type: replace