LM Studio DGX Spark generation speeds for 23 different models

r/LocalLLaMA
Generative AI AI Hardware Open Source AI AI Research

Salutations lads, I ran 23 different models on my Gigabyte Atom (DGX Spark) in LM Studio to benchmark their generation speeds. Theres no real rhyme or reason to the selection of models other than they’re common ones that I have 🤷‍♂️ Im using LM Studio 4.7 with Cuda 13 llama.cpp (Linux ARM) v2.8.0 I loaded the model with their full context window, other than that i left all the other settings as the default stuff.