AI RESEARCH

Thinking with Geometry: Active Geometry Integration for Spatial Reasoning

arXiv CS.CV

ArXi:2602.06037v3 Announce Type: replace Recent progress in spatial reasoning with Multimodal Large Language Models (MLLMs) increasingly leverages geometric priors from 3D encoders. However, most existing integration strategies remain passive: geometry is exposed as a global stream and fused in an indiscriminate manner, which often induces semantic-geometry misalignment and redundant signals. We propose GeoThinker, a framework that shifts the paradigm from passive fusion to active perception.