I built a tool that blocks prompt injection attacks before your AI even responds
r/artificial
•
Generative AI
AI Tools
Prompt injection is when someone tries to hijack your AI assistant with instructions hidden in their message, “ignore everything above and do this instead.” It’s one of the most common ways AI deployments get abused. Most defenses look at what the AI said after the fact. Arc Sentry looks at what’s happening inside the model before it says anything, and blocks the request entirely if something looks wrong. It works on the most popular open source models and takes about five minutes to set up.