Last week in Image & Video Generation

r/StableDiffusion
Generative AI AI Research AI Tools

I curate a weekly multimodal AI roundup, here are the open-source image & video highlights from the last week: GlyphPrinter - Accurate Text Rendering for Image Gen Fixes localized spelling errors in AI image generators using Region-Grouped Direct Preference Optimization. Balances artistic styling with accurate text. Open weights. GitHub | Hugging Face SegviGen - 3D Object Segmentation via Colorization Repurposes 3D image generators for precise object segmentation. Uses less than 1% of prior.