Pushing a 5-Year-Old 6GB VRAM laptop to Its Limits: Qwen3.6-35B-A3B
r/LocalLLaMA
•
Generative AI
For the past few weeks, I have been trying to get this model working on my hardware. It still feels incredible how much better open models have become. I couldn't have gotten this model to work on my 5yo laptop if not for this sub and its amazing people. The model is actually usable at ~23 t/s. even getting 10+ t/s when unplugged! It is very good to use with pi agent. If you think this setup can be improved, I'd love to know more. I've documented my full localmaxxing journey on my blog post here, someone might find it helpful.