AI RESEARCH
A Mechanism Study of Delayed Loss Spikes in Batch-Normalized Linear Models
arXiv CS.LG
•
ArXi:2604.16809v1 Announce Type: cross Delayed loss spikes have been reported in neural-network