Don't sleep on the new Nemotron Cascade
r/LocalLLaMA
•
Open Source AI
While there has been a lot of discussion regarding the Nemotron Super family of models, I feel like the newest addition, the Nemotron Cascade 2 30B-A3B (which is *not* based on the Qwen architecture despite a similar size, it's a properly hybrid model based on Nemotron's own arch) has largely flown under the radar. I've been running some evals on local models lately since I'm kind of tired of the "vibe feels" method of judging them.