AI RESEARCH

GradPower: Powering Gradients for Faster Language Model Pre-Training

arXiv CS.LG

ArXi:2505.24275v2 Announce Type: replace We propose GradPower, a lightweight gradient-transformation technique for accelerating language model pre-