If your AI agent can send emails, browse websites, or call tools, I want to test something with you
r/OpenAI
•
Generative AI
Most prompt injection tools check one message at a time. Mine tracks the whole conversation. That matters because the attacks that actually work in production don’t happen in one message. They happen across 8 turns. Each one looks clean. By the time the payload arrives your agent is already primed to execute it. I built Arc Gate to catch this. It’s a runtime governance proxy that sits between your agent and the model API and watches behavioral trajectory across the full session - not just each individual message. As far as I know nobody else is doing this at the proxy level.