Last week in Image & Video Generation
r/StableDiffusion
•
Generative AI
AI Research
AI Tools
I curate a weekly multimodal AI roundup, here are the open-source image & video highlights from the last week: GlyphPrinter - Accurate Text Rendering for Image Gen Fixes localized spelling errors in AI image generators using Region-Grouped Direct Preference Optimization. Balances artistic styling with accurate text. Open weights. GitHub | Hugging Face SegviGen - 3D Object Segmentation via Colorization Repurposes 3D image generators for precise object segmentation. Uses less than 1% of prior.