Model-Preserving Adaptive Rounding with YAQA
Together AI Blog
•
Generative AI
We’re excited to announce YAQA (Yet Another Quantization Algorithm), a new weight-only LLM post-