AI RESEARCH

Towards Unified World Models for Visual Navigation via Memory-Augmented Planning and Foresight

arXiv CS.AI

ArXi:2510.08713v2 Announce Type: replace Enabling embodied agents to imagine future states is essential for robust and generalizable visual navigation. Yet, state-of-the-art systems typically rely on modular designs that decouple navigation planning from visual world modeling, which often induces state-action misalignment and weak adaptability in novel or dynamic scenarios. We propose UniWM, a unified, memory-augmented world model that integrates egocentric visual foresight and planning within a single multimodal autoregressive backbone.