AI RESEARCH
GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment
arXiv CS.CL
•
ArXi:2605.19577v1 Announce Type: new We present GoLongRL, a fully open-source, capability-oriented post-