Gemma 4 fixes in llama.cpp
r/LocalLLaMA
•
Generative AI
Open Source AI
There have already been opinions that Gemma is bad because it doesn’t work well, but you probably aren’t using the transformers implementation, you’re using llama.cpp. After a model is released, you have to wait at least a few days for all the fixes in llama.cpp, for example:. and maybe there will be more? I had a looping problem in chat, but I also tried doing some stuff in OpenCode (it wasn’t even coding), and there were zero problems. So, probably just like with GLM Flash, a better prompt somehow fixes the overthinking/looping. submitted by /u/jacek2023 [link] [comments.