Multimodal 3D World Model (GitHub Repo)

TLDR AI
Generative AI

Tencent released a multimodal framework that generated and reconstructed 3D worlds from text, images, and video using a staged pipeline and a unified feed-forward model.