Quoting A member of Anthropic’s alignment-science team
Simon Willison Blog
•
Generative AI
The point of the blackmail exercise was to have something to describe to policymakers - results that are visceral enough to land with people, and make misalignment risk actually salient in practice for people who had never thought about it before. - A member of Anthropic’s alignment-science team, as told to Gideon Lewis-Kraus