AI RESEARCH
Chain-of-Thought Degrades Visual Spatial Reasoning Capabilities of Multimodal LLMs
arXiv CS.AI
•
ArXi:2604.16060v1 Announce Type: cross Multimodal Reasoning Models (MRMs) leveraging Chain-of-Thought (CoT) based thinking have revolutionized mathematical and logical problem-solving. However, we show that this paradigm struggles with generalized spatial intelligence. We perform a comprehensive evaluation of seventeen models across thirteen spatial benchmarks and identify a critical gap: CoT prompting consistently degrades performance in visual spatial reasoning.