Mimo2.5 (not pro) under llama.cpp? - primary model opencoder?

r/LocalLLaMA
Generative AI AI Hardware Open Source AI

I tried running AesSedai/MiMo-2.5-GGUF:Q4-K-M under llama.cpp (main tree, compiled 36hours ago) Hardware: nvidia A6000 with 48GB RAM + 300GB CPU RAM I had no success: error loading model: missing tensor blk.0.attn_q.weight. Is Mimo already ed under llama.cpp? From what I read I guessed it runs but is not performnace tweaked yet. Any hints what I did wrong? We started using opencoder. Our primary model is qwen3.6-27b-q8_0 at the moment. Since qwen3.6-122B is not coming I wanted to test alternatives that can be used on the hardware mentioned or on a cluster of 2 x strix or 2 x dgx.