AI RESEARCH

Survey on Evaluation of LLM-based Agents

arXiv CS.LG

ArXi:2503.16416v2 Announce Type: replace-cross LLM-based agents represent a paradigm shift in AI, enabling autonomous systems to plan, reason, and use tools while interacting with dynamic environments. This paper provides the first comprehensive survey of evaluation methods for these increasingly capable agents.