AI RESEARCH
SpatialPoint: Spatial-aware Point Prediction for Embodied Localization
arXiv CS.AI
•
ArXi:2603.26690v1 Announce Type: cross Embodied intelligence fundamentally requires a capability to determine where to act in 3D space. We formalize this requirement as embodied localization -- the problem of predicting executable 3D points conditioned on visual observations and language instructions. We instantiate embodied localization with two complementary target types: touchable points, surface-grounded 3D points enabling direct physical interaction, and air points, free-space 3D points specifying placement and navigation goals, directional constraints, or geometric relations.