AI RESEARCH

Mechanistic Foundations of Goal-Directed Control

arXiv CS.LG

ArXi:2603.15248v1 Announce Type: new Mechanistic interpretability has transformed the analysis of transformer circuits by decomposing model behavior into competing algorithms, identifying phase transitions during