AI RESEARCH
Accelerating RL Post-Training Rollouts via System-Integrated Speculative Decoding
arXiv CS.LG
•
ArXi:2604.26779v1 Announce Type: new RL post-