AI RESEARCH

Spectral Compact Training: Pre-Training Large Language Models via Permanent Truncated SVD and Stiefel QR Retraction

arXiv CS.AI

ArXi:2604.00733v1 Announce Type: cross The memory wall remains the primary bottleneck for