AI RESEARCH
SAGE: Sign-Adaptive Gradient for Memory-Efficient LLM Optimization
arXiv CS.LG
•
ArXi:2604.07663v2 Announce Type: replace The AdamW optimizer, while standard for LLM pre