AI RESEARCH
Robust Adversarial Policy Optimization Under Dynamics Uncertainty
arXiv CS.LG
•
ArXi:2604.10974v1 Announce Type: new Reinforcement learning (RL) policies often fail under dynamics that differ from