AI RESEARCH

On Global Convergence Rates for Federated Softmax Policy Gradient under Heterogeneous Environments

arXiv CS.LG

ArXi:2505.23459v2 Announce Type: replace We provide global convergence rates for vanilla and entropy-regularized federated softmax stochastic policy gradient (FedPG) with local