Last week in Generative Image & Video

r/StableDiffusion
Open Source AI AI Tools

I curate a weekly multimodal AI roundup, here are the open-source image & video highlights from the last week: Motif-Video 2B Open-source 2B DiT, 720p at 121 frames, one checkpoint for both T2V and I2V. 83.76% on VBench Total, highest among open-source, beats Wan2.1-14B at 7x fewer parameters. Caveat: Wan2.1-14B still wins on temporal stability and fine human anatomy in blind tests. Hugging Face HY-World 2.0 (Tencent) First open-source 3D world model outputting editable meshes, 3DGS, and point clouds. Drops straight into Unity, Unreal, and Blender.