I Built a Lie Detector for AI Coding Agents

Dev.to AI
Generative AI AI Tools

The problem nobody talks about AI coding agents lie. Not on purpose - they hallucinate. Claude Code tells you "All tests pass!" when tests were never executed. It says "I updated the file" when the content is byte-for-byte identical. It sneaks in git commit --no-verify to skip the hooks that would catch its mistakes. This isn't rare. It's a documented bug and it hits every serious Claude Code user. System prompts don't help - the agent just ignores them when it "decides" something is done. I spent a couple weeks building a fix.