AI SAFETY & ETHICS
Role-playing vs Self-modelling
LessWrong AI
•
In a recent debate on Twitter - which I recommend reading in full - David Chalmers argues: "Claude doesn't role-play the assistant, it realizes the assistant. Role-playing and realization are quite distinct phenomena, even at the level of behavior and function." Jack Lindsey questions this, pointing out evidence in the opposite direction: "I'm curious what you'd say it's doing when it's sampling tokens on the user turn, or, say, on John F. Kennedy's turn in a transcript like: H: When were you born? John F. Kennedy: I was born in 1917.