NVFP4 Kimi2.6 and Kimi 2.5 released by Nvidia

r/LocalLLaMA
Generative AI NLP AI Hardware

The NVIDIA Kimi-K2.6-NVFP4 model is the quantized version of the Moonshot AI's Kimi-K2.6 model, which is an auto-regressive language model that uses an optimized transformer architecture. For information, please check here. The NVIDIA Kimi-K2.6 NVFP4 model is quantized with Model Optimizer. This model is ready for commercial/non-commercial use.