APEX MoE quants update: 25+ new models since the Qwen 3.5 post + new I-Nano tier

r/LocalLLaMA
Open Source AI AI Research

Quick follow-up on APEX, the MoE-aware mixed-precision quant strategy. The original post was just about Qwen 3.5 35B-A3B; since then the collection has grown to 30+ MoEs across most major families. Plus a new ultra-compressed tier landed. Feedback so far The reports coming back have been honestly better than I expected! Long context holds up. People report APEX I-Balanced and I-Compact retaining coherence well past 32k tokens on the 30-50B-class MoEs, even at sizes where uniform Q4_K starts visibly degrading.