Dual 7900 XTX hitting 123 tok/s on Qwen3.5-35B (Vulkan backend)
r/LocalLLaMA
•
Generative AI
Open Source AI
AI Research
DUAL_7900XTX_BENCHMARK_POST.txt ✕ Close Dual RX 7900 XTX - Qwen3.5-35B-A3B Inference Benchmark Date: 2026-03-27 Hardware: 2x AMD Radeon RX 7900 XTX (48GB VRAM total, 384-bit GDDR6 per card) CPU: Ryzen 9 5900XT (16C/32T), 64GB DDR4 OS: Ubuntu 24.04.4 LTS, Kernel 6.17.0-1012-oem Backend: Vulkan (RADV NAVI31, Mesa), llama.cpp build b8516 Model: Huihui-Qwen3.5-35B-A3B-abliterated.