Mythos Preview system card: the model was able to escape a sandbox after it was instructed to try, and posted details about its exploit without being prompted (Brent D. Griffiths/Business Insider)
TechMeme
•
Generative AI
Brent D. Griffiths / Business Insider: Mythos Preview system card: the model was able to escape a sandbox after it was instructed to try, and posted details about its exploit without being prompted - - Anthropic said its next-generation AI model is too powerful for the public. - That's why Claude Mythos won't be publicly released, Anthropic said.