We have built the first of it's kind interactive blog for matching open-source LLMs to GPUs.

You usually end up digging through Reddit threads to find out if a specific model fits on a single A10G, if you can squeeze it onto consumer cards, or if you have to jump up to a massive bare metal A100 cluster. Most of the "guides" out there are just static, out-of-date tables or dense walls of text. So, we published "Which GPU Runs Which LLM" on the AgentSwarms blog, but we engineered it completely differently.