FINALLY GEMMA 4 KV CACHE IS FIXED

r/LocalLLaMA
Generative AI Open Source AI

YESSS LLAMA. CPP IS UPDATED AND IT DOESN'T TAKE UP PETABYTES OF VRAM submitted by /u/FusionCow [link] [comments]