Seeing the Evidence, Missing the Answer: Tool-Guided Vision-Language Models on Visual Illusions

ArXi:2603.29428v1 Announce Type: new Vision-language models (VLMs) exhibit a systematic bias when confronted with classic optical illusions: they overwhelmingly predict the illusion as "real" regardless of whether the image has been counterfactually modified. We present a tool-guided inference framework for the DataCV 2026 Challenge (Tasks I and II) that addresses this failure mode without any model