Model-Preserving Adaptive Rounding with YAQA

Together AI Blog
Generative AI

We’re excited to announce YAQA (Yet Another Quantization Algorithm), a new weight-only LLM post-