[Benchmark] The Ultimate Llama.cpp Shootout: RTX 5090 vs DGX Spark vs AMD AI395 & R9700 (ROCm/Vulkan)
r/LocalLLaMA
•
Generative AI
Open Source AI
AI Research
Hi r/LocalLLaMA! I’ve been running some deep benchmarks on a diverse local cluster using the latest llama-bench (build 8463). I wanted to see how the new RTX 5090 compares to enterprise-grade DGX Spark (GB10), the massive unified memory of the AMD AI395 (Strix Halo), and a dual setup of the AMD Radeon AI PRO R9700. I tested Dense models (32B, 70B) and MoE models (35B, 122B) from the Qwen family. Here are my findings: 🚀 Key Takeaways: 1. RTX 5090 is an Absolute Monster (When it fits) If the model fits entirely in its 32GB VRAM, the 5090 is unmatched.