Consistent 3D Scenes from Video Diffusion (16 minute read)

TLDR AI
Computer Vision

WorldStereo introduces geometric memory modules that guide video diffusion models to generate camera‑consistent multi‑view videos while enabling 3D reconstruction.