AI RESEARCH

SimpleQA Verified: A Reliable Factuality Benchmark to Measure Parametric Knowledge

arXiv CS.CL

ArXi:2509.07968v2 Announce Type: replace