I was hyped for Nemotron 3 4B and it completely disappointed me compared to Qwen 3.5 4B

So I have been running some pretty demanding benchmarks on local models lately, and last week I posted results showing that Qwen 3.5 4B Q8 passed every single one of my custom tests. I was genuinely impressed. Then Nemotron 3 4B dropped today and I immediately grabbed the Q8 quant to put it through the same gauntlet. Spoiler: it did not go well. The thing that had me excited about Nemotron was its different architecture, which supposedly allows for much larger context windows. That sounded promising on paper.