What are the best 40-500 B MoE LLM models now?
r/LocalLLaMA
•
Generative AI
AI Hardware
Open Source AI
Due to old GPU I run on CPU and came to appreciate value of MoE. I know of MoE for Qwen 3.6 and Gemma-4, which are <40B. I want to try some larger models with low number of effective weights. Web search found only posts from 9 month ago, e.g.: Due so many models having been released recently, I assume the info in the above post needs an update. TIA P. S. Right now I have RAM to run models in the lower end of 40-500 range, my primary interest is in 40-100, but setting fixed upper range might prevent good info to be received. submitted by /u/alex20_202020 [link] [comments.