Last week in Generative Image & Video

I curate a weekly multimodal AI roundup, here are the open-source image & video highlights from the last week: GEMS - Closed-loop system for spatial logic and text rendering in image generation. Outperforms Nano Banana 2 on GenEval2. GitHub | Paper ComfyUI Post-Processing Suite - Photorealism suite by thezveroboy. Simulates sensor noise, analog artifacts, and camera metadata with base64 EXIF transfer and calibrated DNG writing. GitHub CutClaw - Open multi-agent video editing framework. Autonomously cuts hours of footage into narrative shorts.