AI RESEARCH

Effectiveness of Distributed Gradient Descent with Local Steps for Overparameterized Models

arXiv CS.LG

ArXi:2412.07971v2 Announce Type: replace