What Happens When Your AI Agent Goes Rogue

We're building PolicyLayer - open-source policy enforcement for MCP agents. This post is a catalogue of the real failure modes we've seen (and heard about) when agents access without constraints. If you're connecting agents to Stripe, GitHub, AWS, or anything with side effects, this one's for you. When an AI agent goes rogue, it doesn't announce itself. Nobody sets out to build an AI agent that deletes a production database, spends $15,000 on Stripe charges, or opens 200 duplicate GitHub issues.