AI RESEARCH

FACTS Benchmark Suite: Systematically evaluating the factuality of large language models

DeepMind Blog • December 09, 2025

Systematically evaluating the factuality of large language models with the FACTS Benchmark Suite.