AI SAFETY & ETHICS
Your Causal Variables Are Irreducibly Subjective
LessWrong AI
•
Mechanistic interpretability needs its own shoe leather era. Reproducing the labeling process will matter than reproducing the Github. Crossposted from Communication & Intelligence substack When we try to understand large language models, we like to invoke causality. And who can blame us? Causal inference comes with an impressive toolkit: directed acyclic graphs, potential outcomes, mediation analysis, formal identification results. It feels crisp. It feels reproducible. It feels like science.