AI RESEARCH

Rethinking how we measure AI intelligence

DeepMind Blog

Game Arena is a new, open-source platform for rigorous evaluation of AI models. It allows for head-to-head comparison of frontier systems in environments with clear winning conditions.