AI RESEARCH

Stereo World Model: Camera-Guided Stereo Video Generation

arXiv CS.CV

ArXi:2603.17375v1 Announce Type: new We present StereoWorld, a camera-conditioned stereo world model that jointly learns appearance and binocular geometry for end-to-end stereo video generation. Unlike monocular RGB or RGBD approaches, StereoWorld operates exclusively within the RGB modality, while simultaneously grounding geometry directly from disparity. To efficiently achieve consistent stereo generation, our approach