AI RESEARCH

FedQueue: Queue-Aware Federated Learning for Cross-Facility HPC Training

arXiv CS.LG

ArXi:2605.02125v1 Announce Type: cross Federated learning (FL) across multiple HPC facilities faces stochastic admission delays from batch schedulers that dominate wall-clock time. Synchronous FL suffers from severe stragglers, while asynchronous FL accumulates stale updates when queues spike. We propose FedQueue, a queue-aware FL protocol that incorporates scheduler delays directly into