Multimodal 3D World Model (GitHub Repo)
TLDR AI
•
Generative AI
Tencent released a multimodal framework that generated and reconstructed 3D worlds from text, images, and video using a staged pipeline and a unified feed-forward model.