ServiceNow-AI/SuperApriel-15B-Instruct · Hugging Face
r/LocalLLaMA
•
Machine Learning
AI Research
AI Tools
A 15B-parameter token-mixer supernet with 8 optimized deployment presets spanning 1.0× to 10.7× decode throughput at 32K sequence length, all from a single checkpoint. Derived from Apriel-1.6 through stochastic distillation and targeted supervised fine-tuning.