When are we going to see natively multimodal local text-image models?

r/StableDiffusion • April 26, 2026

Generative AI

Inputs: img/txt, outputs: img/txt. Predictions please. submitted by /u/wojtulace [link] [comments]