MTP is all about acceptance rate

r/LocalLLaMA
Generative AI

So I was very excited about the MTP stuff especially since Gemma4 has become my "daily driver" for some stuff. I grabbed the latest mlx-vlm and did some tests and found it disappointing.