Google says new TurboQuant compression can lower AI memory usage without sacrificing quality

Ars Technica AI
Generative AI

TurboQuant makes AI models efficient but doesn't reduce output quality like other methods.