Beyond Accuracy: Introducing a Symbolic-Mechanistic Approach to Interpretable Evaluation

ArXi:2603.23517v1 Announce Type: new Accuracy-based evaluation cannot reliably distinguish genuine generalization from shortcuts like memorization, leakage, or brittle heuristics, especially in small-data regimes. In this position paper, we argue for mechanism-aware evaluation that combines task-relevant symbolic rules with mechanistic interpretability, yielding algorithmic pass/fail scores that show exactly where models generalize versus exploit patterns. We nstrate this on NL-to-SQL by