AI RESEARCH
GradPower: Powering Gradients for Faster Language Model Pre-Training
arXiv CS.LG
•
ArXi:2505.24275v2 Announce Type: replace We propose GradPower, a lightweight gradient-transformation technique for accelerating language model pre-