AI RESEARCH

Low-Rank Compression of Pretrained Models via Randomized Subspace Iteration

arXiv CS.AI

ArXi:2604.02659v1 Announce Type: cross The massive scale of pretrained models has made efficient compression essential for practical deployment. Low-rank decomposition based on the singular value decomposition (SVD) provides a principled approach for model reduction, but its exact computation is expensive for large weight matrices. Randomized alternatives such as randomized SVD (RSVD) improve efficiency, yet they can suffer from poor approximation quality when the singular value spectrum decays slowly, a regime commonly observed in modern pretrained models.