AI RESEARCH
Gradient Compression Beyond Low-Rank: Wavelet Subspaces Compact Optimizer States
arXiv CS.AI
•
ArXi:2501.07237v4 Announce Type: replace-cross Large language models (LLMs) have shown impressive performance across a range of natural language processing tasks. However, their vast number of parameters