Google's TurboQuant saves memory, but won't save us from DRAM-pricing hell
The Register AI
•
Generative AI
Chocolate Factory’s compression tech clears the way to cheaper AI inference, not affordable memory When Google unveiled TurboQuant, an AI data compression technology that promises to slash the amount of memory required to serve models, many hoped it would help with a memory shortage that has seen prices triple since last year. Not so much