AI RESEARCH

ESI-Bench: Towards Embodied Spatial Intelligence that Closes the Perception-Action Loop

arXiv CS.LG

ArXi:2605.18746v1 Announce Type: cross Spatial intelligence unfolds through a perception-action loop: agents act to acquire observations, and reason about how observations vary as a function of action. Rather than passively processing what is seen, they actively uncover what is unseen - occluded structure, dynamics, containment, and functionality that cannot be resolved from passive sensing alone. We move beyond prior formulations of spatial intelligence that assume oracle observations by recasting the observer as an actor. We.