AI RESEARCH

Train at Moving Edge: Online-Verified Prompt Selection for Efficient RL Training of Large Reasoning Model

arXiv CS.AI

ArXi:2603.25184v1 Announce Type: cross Reinforcement learning (RL) has become essential for post-