MiMo-V2.5-GGUF (preview available)
r/LocalLLaMA
•
Generative AI
Open Source AI
Hi, AesSedai here - I've put up a PR to the text-to-text inference of MiMo V2.5 with llama.cpp (and should also Pro, will work on those quants after finishing V2.5): I've also put some quants up on HF, the Q8_0 as well as my usual MoE-optimized quants (for those unfamiliar, it's basically Q8_0 or Q6_K for most of the model, and quanting the FFNs down