AI RESEARCH
A Layer Separation Optimization Framework for Cross-Entropy Training in Deep Learning
arXiv CS.LG
•
ArXi:2604.23225v1 Announce Type: new This paper investigates the deep learning optimization problem with softmax cross-entropy loss. We propose a layer separation strategy to alleviate the strong nonconvexity encountered during