AI RESEARCH

Theory of Optimal Learning Rate Schedules and Scaling Laws for a Random Feature Model

arXiv CS.LG

ArXi:2602.04774v2 Announce Type: replace-cross Setting the learning rate (LR) for a deep learning model is a critical part of successful