Chromium AI Image Description Plugin

r/StableDiffusion
Generative AI Open Source AI

Not sure how much use people will get out of this figured I would post this anyways. This uses the Qwen 3.5 LLM workflow (in it's code). It can work with both Gemma 3 and Qwen 3.5 Models. Though I have only listed the official models that I know worked. I was not able to verify Abliterated or other models that vlm with comfy working. I can always update with those model names as well. Or might just make a model loader (looking for all with qwen or gemma in the name), but the overall concern was people using the models that don't work with vision and asking for a miracle to happen.