llama-bench ROCm 7.2 on Strix Halo (Ryzen AI Max+ 395) — Qwen 3.5 Model Family

r/LocalLLaMA
Generative AI Open Source AI

Llama-bench ROCm 7.2 on Strix Halo (Ryzen AI Max+ 395) - Qwen 3.5 Model Family Running llama-bench with ROCm 7.2 on AMD Ryzen AI Max+ 395 (Strix Halo) with 128GB unified memory. All models are from Unsloth (UD quants