AI RESEARCH

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

arXiv CS.CL

ArXi:2605.19577v1 Announce Type: new We present GoLongRL, a fully open-source, capability-oriented post-