llama.cpp fixes to run Bonsai 1-bit models on CPU (incl AVX512) and AMD GPUs

r/LocalLLaMA • April 02, 2026

Generative AI Open Source AI

PrismAI's fork of llama.cpp is broken if you try to run on CPU. This also includes instructions for running on AMD GPUs via ROCm. submitted by /u/UncleOxidant [link] [comments]

Read Full Article