Qwen3.5 reprocessess images in multi-turn. Chat Template bug?

r/LocalLLaMA
Generative AI Open Source AI AI Research

When giving an image to Qwen3-VL and then having a multi turn discussion about it, it does not reprocess the image with each subsequent message. However Qwen3.5 does that exactly. At each new user message it reprocesses the previous images by the vision encoder. Why? Is this a bug in the chat template? I am using unsloths gguf with their chat template, but I also tried the original chat template. Both are doing the same. PS: Oh, an I am using llama.cpp submitted by /u/erazortt [link] [comments]