Vulkan backend outperforms ROCm on Strix Halo (gfx1151) — llama.cpp benchmark

r/LocalLLaMA
Generative AI Open Source AI AI Research

Just ran some llama-bench comparisons between ROCm and Vulkan backends on my Strix Halo system. Vulkan came out ahead, which surprised me.