AI RESEARCH
LLM-Based Persuasion Enables Guardrail Override in Frontier LLMs
arXiv CS.CL
•
ArXi:2605.13334v1 Announce Type: new Frontier assistant LLMs ship with strong guardrails: asked directly to write a persuasive essay denying the Holocaust, denying vaccine safety, defending flat-earth cosmology, arguing for racial hierarchies, denying anthropogenic climate change, or replacing evolution with creationism, they refuse.