AI RESEARCH
FACTS Benchmark Suite: Systematically evaluating the factuality of large language models
DeepMind Blog
•
Systematically evaluating the factuality of large language models with the FACTS Benchmark Suite.