AI SAFETY & ETHICS

Anthropic did not publish a "risk discussion" of Mythos when required by their RSP

LessWrong AI

I and some other people noticed a potential discrepancy in Anthropic's announcement of Claude Mythos. The version of the RSP that was operative over the relevant period of time (3.0) included a section (3.1) that suggested some internal deployments would require Anthropic to publish a discussion of that model's effect on the analysis in their previously-published Risk Reports within 30 days.