AI RESEARCH
$\pi$-Play: Multi-Agent Self-Play via Privileged Self-Distillation without External Data
arXiv CS.LG
•
ArXi:2604.14054v1 Announce Type: new Deep search agents have emerged as a promising paradigm for addressing complex information-seeking tasks, but their