Built a prompt injection proxy that beats OpenAI Moderation and LlamaGuard — see it block attacks live

r/artificial
Generative AI NLP

Built Arc Gate - sits in front of any OpenAI-compatible endpoint and blocks prompt injection before it reaches your model. Try it here - no signup, no code, no setup: Type any prompt and see if it gets blocked or passes. The examples on the page show the difference. The main detection layer is a behavioral SVM on sentence-transformer embeddings - catches semantic intent, not just pattern matches. Phrase matching is just the fast first pass. Four layers total.