AI RESEARCH

World2Mind: Cognition Toolkit for Allocentric Spatial Reasoning in Foundation Models

arXiv CS.AI

ArXi:2603.09774v1 Announce Type: new Achieving robust spatial reasoning remains a fundamental challenge for current Multimodal Foundation Models (MFMs). Existing methods either overfit statistical shortcuts via 3D grounding data or remain confined to 2D visual perception, limiting both spatial reasoning accuracy and generalization in unseen scenarios. Inspired by the spatial cognitive mapping mechanisms of biological intelligence, we propose World2Mind, a