AI RESEARCH

PubSwap: Public-Data Off-Policy Coordination for Federated RLVR

arXiv CS.LG

ArXi:2604.12160v1 Announce Type: new Reasoning post-