Evaluating Qwen3.5-35B & 122B on Strix Halo: Bartowski vs. Unsloth UD-XL Performance and Logic Stability
r/LocalLLaMA
•
Generative AI
Open Source AI
Hi, i tested new unsloth "dynamic" quants, 35B and 122B with one bartowski quant for referance. I used llama.cpp recent build b8248 and compared with tests i did recently with older build b8204, the former one include already some optimizations merged in b8233 which i recently published. In the diagram you can already see the performance improvement for ROCm, but not so much for Vulkan.