AI SAFETY & ETHICS
Convergent Abstraction Hypothesis
LessWrong AI
•
Tl;dr Convergent abstraction hypothesis posits abstractions are often convergent in the sense of convergent evolution: different cognitive systems converge on the same abstraction, when facing similar selection pressures and learning in similar environments. It is a less ambitious alternative to 'natural abstractions hypotheses' and, in my view, likely to be true. Convergence may be real, useful, and empirically robust, while still being contingent and fragile under changes in architecture