AI RESEARCH
PubSwap: Public-Data Off-Policy Coordination for Federated RLVR
arXiv CS.LG
•
ArXi:2604.12160v1 Announce Type: new Reasoning post-