As MTP prepares to land in llama.cpp, Models that support MTP

r/LocalLLaMA
Generative AI Open Source AI

DeepSeekv3 OG DeepSeekv3.2/4 Qwen3.5 GLM4.5+ MiniMax2.5+ Step3.5Flash Mimo v2+ Until we get mtp weights, you need to download HF weights and convert to gguf. I think I'm going to try either qwen3.5-122b or glm4.5-air first. submitted by /u/segmond [link] [comments]