AI RESEARCH

HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation

arXiv CS.CV

ArXi:2604.28196v1 Announce Type: new Driving world models serve as a pivotal technology for autonomous driving by simulating environmental dynamics. However, existing approaches predominantly focus on future scene generation, often overlooking comprehensive 3D scene understanding. Conversely, while Large Language Models (LLMs) nstrate impressive reasoning capabilities, they lack the capacity to predict future geometric evolution, creating a significant disparity between semantic interpretation and physical simulation.