Your LLM Is Probably Suffocating Your Mac

A step-by-step guide using llmfit + Ollama to host DeepSeek Coder V2 16B on Apple Silicon Running LLMs locally is easier than ever, but most tutorials skip the most important step: figuring out which model actually fits your hardware before you pull it. Download the wrong one and you’re waiting 20 minutes for a model that crawls at 2 tokens per second, or worse, one that doesn’t run at all. In this guide, you’ll use llmfit alongside Ollama to detect your hardware, identify a model that fits well, pull it with a single keystroke, and run it locally,all on a MacBook Pro with Apple Silicon.