AI SAFETY & ETHICS
Anthropic did not publish a "risk discussion" of Mythos when required by their RSP
LessWrong AI
•
I and some other people noticed a potential discrepancy in Anthropic's announcement of Claude Mythos. The version of the RSP that was operative over the relevant period of time (3.0) included a section (3.1) that suggested some internal deployments would require Anthropic to publish a discussion of that model's effect on the analysis in their previously-published Risk Reports within 30 days.