Gemma 4 on LocalAI: Vulkan vs ROCm

Gemma 4 on LocalAI: Vulkan vs ROCm Hey everyone! 👋 Just finished running a bunch of benchmarks on the new Gemma 4 models using LocalAI and figured I'd share the results. I was curious how Vulkan and ROCm backends stack up against each other, and how the 26B MoE (only ~4B active params) compares to the full 31B dense model in practice.