One-command local AI stack for AMD Strix Halo
r/LocalLLaMA
•
Generative AI
AI Hardware
Open Source AI
Built an Ansible playbook to turn AMD Strix Halo machines into local AI inference servers Hey all, I've been running local LLMs on my Framework Desktop (AMD Strix Halo, 128 GB unified memory) and wanted a reproducible, one-command setup. So I packaged everything into an Ansible playbook and put it on GitHub. What it does: - Configures Fedora 43 Server on AMD Strix Halo machines (Framework Desktop, GMKtec EVO-X2, etc.) - Installs and configures **llama.cpp** with full GPU offload via ROCm/Vulkan using pre-built toolbox containers (huge thanks to kyuz0 for the amd-strix-halo-toolboxes work.