AI RESEARCH
Bug or Feature$^2$: Weight Drift, Activation Sparsity, and Spikes
arXiv CS.LG
•
ArXi:2605.17659v1 Announce Type: new The design of modern neural architectures has converged through incremental empirical choices, yet the mechanisms governing their