AI RESEARCH
Train at Moving Edge: Online-Verified Prompt Selection for Efficient RL Training of Large Reasoning Model
arXiv CS.AI
•
ArXi:2603.25184v1 Announce Type: cross Reinforcement learning (RL) has become essential for post-