Philosopher David Chalmers: Current AI interpretability methods miss what matters most

The Decoder
AI Safety

Philosopher David J. Chalmers proposes interpreting AI systems through their attitudes toward propositions - much like we interpret humans. His concept of "propositional interpretability" aims to put mechanistic AI explanation on new footing, drawing on philosophical theories of human understanding.