AI RESEARCH
FedQueue: Queue-Aware Federated Learning for Cross-Facility HPC Training
arXiv CS.LG
•
ArXi:2605.02125v1 Announce Type: cross Federated learning (FL) across multiple HPC facilities faces stochastic admission delays from batch schedulers that dominate wall-clock time. Synchronous FL suffers from severe stragglers, while asynchronous FL accumulates stale updates when queues spike. We propose FedQueue, a queue-aware FL protocol that incorporates scheduler delays directly into