Findings from testing Qwen3.5 4B and 35B, on the same query
r/LocalLLaMA
•
Generative AI
Robotics
AI Safety
Open Source AI
I've been testing the new Qwen 3.5 4B and 35B on a 3060 12Gb, with the correct suggested settings. Using Jan on a desktop PC, and with Jan running the latest b8233 Llama framework. My test query was about the likely range of scientific/research uses of a base on the dark-side of the Moon, circa 2065. 4B runs very fast on a 3060 12Gb card, as expected. 35B runs slow (output is at fast human reading pace, with lots of 'thinking', so maybe six minutes to get a 1,000 word essay