AI RESEARCH
SAGE: A Top-Down Bottom-Up Knowledge-Grounded User Simulator for Multi-turn AGent Evaluation
arXiv CS.CL
•
ArXi:2510.11997v3 Announce Type: replace Evaluating multi-turn interactive agents is challenging due to the need for human assessment. Evaluation with simulated users has been