Built a tool that stops AI agents from being hijacked by malicious content in webpages and emails
r/artificial
•
Generative AI
If you’ve heard of prompt injection - where hidden instructions in a webpage can take over an AI agent - this is a practical solution for developers deploying agents in production. Arc Gate is a proxy that sits in front of any OpenAI-compatible API. It tracks who is allowed to give instructions to the agent. When a webpage or email tries to issue instructions, it gets treated as untrusted content with zero instruction authority. The agent is protected without the developer having to change anything except the.