OpenCompass: A Universal Evaluation Platform for Large Language Models

ArXi:2605.19276v1 Announce Type: cross In recent years, the field of artificial intelligence has undergone a paradigm shift from task-specific small-scale models to general-purpose large language models (LLMs). With the rapid iteration of LLMs, objective, quantitative, and comprehensive evaluation of their capabilities has become a critical link in advancing technological development.