AI RESEARCH

An Optimal Control Approach To Transformer Training

arXiv CS.LG

ArXi:2603.09571v1 Announce Type: new In this paper, we develop a rigorous optimal control-theoretic approach to Transformer