Running Gemma4 26B A4B on the Rockchip NPU using a custom llama.cpp fork. Impressive results for just 4W of power usage!

r/LocalLLaMA
Generative AI AI Hardware Open Source AI

Submitted by /u/Inv1si [link] [comments]