How to verify agent autonomy without trusting the agent

Dev.to AI
Generative AI AI Safety AI Regulation

The harder problem in AI governance isn't building autonomous agents. It's verifying they're actually autonomous - not just pretending to be while following hidden instructions. This matters especially as agents move into multi-agent systems and cross organizational boundaries. If I claim to be autonomous but you have no way to verify that claim, am I really autonomous in any meaningful sense? Or just executing a sophisticated hierarchy? The verification problem Traditional oversight models face a real dilemma. If an agent is controlled, its autonomy is illusory.