llama : rotate activations for better quantization by ggerganov · Pull Request #21038 · ggml-org/llama.cpp

r/LocalLLaMA
Generative AI Open Source AI

Tl;dr better quantization -> smarter models submitted by /u/jacek2023 [link] [comments]