We Published Our AI Guardrail's 37% Detection Rate. Here's What We Learned.
Dev.to AI
•
Generative AI
AI Safety
AI Hardware
AI Research
The numbers We ran NVIDIA's Garak red team scanner against TealTiger, our open-source governance engine for AI agents. Results: Benchmark Score Garak jailbreak 40% detection Garak prompt injection 40% detection Garak data leakage 6.7% detection PINT precision 85.7% PINT recall 40% PINT F1 54.5% Not great. Why publish bad numbers? Because TealTiger's core claim is deterministic, auditable governance.