Quoting A member of Anthropic’s alignment-science team

Simon Willison Blog
Generative AI

The point of the blackmail exercise was to have something to describe to policymakers - results that are visceral enough to land with people, and make misalignment risk actually salient in practice for people who had never thought about it before. - A member of Anthropic’s alignment-science team, as told to Gideon Lewis-Kraus