We built a public red team environment for our AI agent security proxy — submit attacks and get a full security trace back
r/artificial
•
Generative AI
AI Research
Live adversarial evaluation: Arc Gate is a runtime governance layer for LLM agents. It sits between your app and the OpenAI API and enforces instruction-authority boundaries - tracking who is allowed to instruct the agent and from what source. Webpages, emails, tool outputs, and retrieved documents have zero instruction authority. Submit any attack. Every submission runs against the real proxy and returns a full decision trace, risk score, capability policy, and downloadable JSON report. Confirmed bypasses get documented publicly and patched in the next release.