AI coding agents need receipts, not just better prompts

AI coding agents are getting good enough to run real engineering tasks, but not safe enough to run without guardrails. The failure mode is not always dramatic. Sometimes the agent just keeps working. It retries. It rewrites. It spends tokens. It changes files. It says it is done. Then another engineer opens the diff and realizes the agent solved the wrong problem. That creates a new engineering question: Can another engineer audit this run later? That is why I’m building MartinLoop. MartinLoop is an open-source control plane for AI coding agents.