AI RESEARCH
Decoupled DiLoCo for Resilient Distributed Pre-training
arXiv CS.CL
•
ArXi:2604.21428v1 Announce Type: new Modern large-scale language model pre-