FINALLY GEMMA 4 KV CACHE IS FIXED
r/LocalLLaMA
•
Generative AI
Open Source AI
YESSS LLAMA. CPP IS UPDATED AND IT DOESN'T TAKE UP PETABYTES OF VRAM submitted by /u/FusionCow [link] [comments]