AI SAFETY & ETHICS

The Artificial Self

LessWrong AI

A new paper and microsite about self-models and identity in AIs: site | arXi | Twitter We present an ontology, make some claims, and provide some experimental evidence. In this post, I'll mostly cover the claims and cross-post the conceptual part of the text. You can find the experiments on the site, and we will cover some of the results in a separate post. Maximally compressed version of the claims I expect many people to already agree with many of these, or find them second kind of obvious. If you do, you may still find some of the specific arguments interesting.