AI RESEARCH
Byzantine-Robust Distributed SGD: A Unified Analysis and Tight Error Bounds
arXiv CS.LG
•
ArXi:2604.10179v1 Announce Type: cross Byzantine-robust distributed optimization relies on robust aggregation rules to mitigate the influence of malicious Byzantine workers. Despite the proliferation of such rules, a unified convergence analysis framework that accommodates general data heterogeneity is lacking. In this work, we provide a thorough convergence theory of Byzantine-robust distributed stochastic gradient descent (SGD), analyzing variants both with and without local momentum.