AI RESEARCH

SAGE: A Top-Down Bottom-Up Knowledge-Grounded User Simulator for Multi-turn AGent Evaluation

arXiv CS.CL

ArXi:2510.11997v3 Announce Type: replace Evaluating multi-turn interactive agents is challenging due to the need for human assessment. Evaluation with simulated users has been