Move to backend sampling for MTP draft path by gaugarg-nv · Pull Request #23287 · ggml-org/llama.cpp
r/LocalLLaMA
•
Generative AI
Open Source AI
Improved MTP performance submitted by /u/jacek2023 [link] [comments]