AI RESEARCH
Trajectory First: A Curriculum for Discovering Diverse Policies
arXiv CS.LG
•
ArXi:2506.01568v3 Announce Type: replace Being able to solve a task in diverse ways makes agents robust to task variations and less prone to local optima. In this context, constrained diversity optimization has become a useful reinforcement learning (RL) framework for