Last week in Generative Image & Video

r/StableDiffusion
NLP Open Source AI

I curate a weekly multimodal AI roundup, here are the open-source image & video highlights from the last week: DaVinci-MagiHuman - Open-Source Video+Audio Generation 15B single-stream Transformer jointly generating video and audio. Full stack released under Apache 2.0. 80% win rate vs Ovi 1.1, 60.9% vs LTX 2.3 in human eval. 7 languages. 720p at 40 FPS, 5B parameters. Model PSDesigner - Automated Graphic Design Open-source automated graphic design using human-like creative workflow.