AI RESEARCH
Effectiveness of Distributed Gradient Descent with Local Steps for Overparameterized Models
arXiv CS.LG
•
ArXi:2412.07971v2 Announce Type: replace