Nemotron 3 Super 120b Claude Distilled
r/LocalLLaMA
•
Generative AI
AI Research
Hello everyone, Just wanted to post my V1 iteration of Nemotron 3 super 120B distilled from the 4.6 3000x dataset. This is a beta for the most part only, ~2.3K examples so far from the 3000x dataset. Planning a V2 with data just can't afford it right now. Would love to hear results and suggestions, in some quick tests it seemed like it worked but let me know if I lobotomized it or not. Available in BF16, FP8, and GGUF (Q4_K_M + Q8_0) submitted by /u/ghgi_ [link] [comments.