Findings from testing Qwen3.5 4B and 35B, on the same query

r/LocalLLaMA
Generative AI Robotics AI Safety Open Source AI

I've been testing the new Qwen 3.5 4B and 35B on a 3060 12Gb, with the correct suggested settings. Using Jan on a desktop PC, and with Jan running the latest b8233 Llama framework. My test query was about the likely range of scientific/research uses of a base on the dark-side of the Moon, circa 2065. 4B runs very fast on a 3060 12Gb card, as expected. 35B runs slow (output is at fast human reading pace, with lots of 'thinking', so maybe six minutes to get a 1,000 word essay