Last week in Generative Image & Video
r/StableDiffusion
•
NLP
Open Source AI
I curate a weekly multimodal AI roundup, here are the open-source image & video highlights from the last week: DaVinci-MagiHuman - Open-Source Video+Audio Generation 15B single-stream Transformer jointly generating video and audio. Full stack released under Apache 2.0. 80% win rate vs Ovi 1.1, 60.9% vs LTX 2.3 in human eval. 7 languages. 720p at 40 FPS, 5B parameters. Model PSDesigner - Automated Graphic Design Open-source automated graphic design using human-like creative workflow.