Running Gemma4 26B A4B on the Rockchip NPU using a custom llama.cpp fork. Impressive results for just 4W of power usage!
r/LocalLLaMA
•
Generative AI
AI Hardware
Open Source AI
Submitted by /u/Inv1si [link] [comments]