Move to backend sampling for MTP draft path by gaugarg-nv · Pull Request #23287 · ggml-org/llama.cpp

r/LocalLLaMA
Generative AI Open Source AI

Improved MTP performance submitted by /u/jacek2023 [link] [comments]