AI RESEARCH

Schedule-and-Calibrate: Utility-Guided Multi-Task Reinforcement Learning for Code LLMs

arXiv CS.AI

ArXi:2605.06111v1 Announce Type: cross Reinforcement learning (RL) with verifiable rewards has proven effective at post-