Built a training stability monitor that detects instability before your loss curve shows anything — open sourced the core today

r/artificial
Machine Learning AI Research

Been working on a weight divergence trajectory curvature approach to detecting neural network