AI RESEARCH
Rethinking Neural Network Learning Rates: A Stackelberg Perspective
arXiv CS.LG
•
ArXi:2605.15530v1 Announce Type: new Neural networks are typically trained with a single learning rate across all layers. While recent empirical evidence suggests that assigning layer-specific learning rates can accelerate