ServiceNow-AI/SuperApriel-15B-Instruct · Hugging Face

r/LocalLLaMA
Machine Learning AI Research AI Tools

A 15B-parameter token-mixer supernet with 8 optimized deployment presets spanning 1.0× to 10.7× decode throughput at 32K sequence length, all from a single checkpoint. Derived from Apriel-1.6 through stochastic distillation and targeted supervised fine-tuning.