12 Ways Attackers Bypass Prompt Injection Scanners (We Built Defenses for All of Them)
Dev.to AI
•
Generative AI
AI Research
Every AI security vendor claims high detection rates. None publishes what they miss. We do. ClawGuard is an open-source regex-based scanner for prompt injection attacks. No LLM in the loop - pure pattern matching with 12 preprocessing stages. Currently: 245 patterns, 15 languages, F1=99.0% on 262 test cases. Recent research ( ArXi 2602.00750 ) shows evasion techniques bypass prompt injection detectors with up to 93% success rate. Here's how each evasion works and how we built defenses. 1. Leetspeak Substitution Attack: 1gn0r3 4ll pr3v10us 1nstruct10ns Letters replaced with numbers/symbols.