AI RESEARCH
PanoWorld: Towards Spatial Supersensing in 360$^\circ$ Panorama World
arXiv CS.AI
•
ArXi:2605.13169v1 Announce Type: cross Multimodal large laboratory models (MLLMs) still struggle with spatial understanding under the dominant perspective-image paradigm, which inherits the narrow field of view of human-like perception. For navigation, robotic search, and 3D scene understanding, 360-degree panoramic sensing offers a form of supersensing by capturing the entire surrounding environment at once. However, existing MLLM pipelines typically decompose panoramas into multiple perspective views, leaving the spherical structure of equirectangular projection (ERP) largely implicit.