AI RESEARCH
Mechanistic Foundations of Goal-Directed Control
arXiv CS.LG
•
ArXi:2603.15248v1 Announce Type: new Mechanistic interpretability has transformed the analysis of transformer circuits by decomposing model behavior into competing algorithms, identifying phase transitions during