AI RESEARCH

Accelerating RL Post-Training Rollouts via System-Integrated Speculative Decoding

arXiv CS.LG

ArXi:2604.26779v1 Announce Type: new RL post-