Qwen3.5 27B and 35B with 2x AMD 7900 XTX vLLM bench serve results

r/LocalLLaMA
AI Hardware Open Source AI AI Tools

I've enjoyed the recent reports of success with Qwen3.5 using vLLM with multiple AMD GPU, especially for such a dwindling market share these days! Here are some 'bench serve' results from 2x 7900 XTX and the smaller Qwen 3.5 models, cyankiwi/Qwen3.5-27B-AWQ-__TECH_PRESERVE_18TECH_PRESERVE_17__ and cyankiwi/Qwen3.5-35B-A3B-AWQ-4bit.