AI RESEARCH

EgoMind: Activating Spatial Cognition through Linguistic Reasoning in MLLMs

arXiv CS.CV

ArXi:2604.03318v1 Announce Type: new Multimodal large language models (MLLMs) are increasingly being applied to spatial cognition tasks, where they are expected to understand and interact with complex environments. Most existing works improve spatial reasoning by