AI RESEARCH

SAGE: Sign-Adaptive Gradient for Memory-Efficient LLM Optimization

arXiv CS.LG

ArXi:2604.07663v2 Announce Type: replace The AdamW optimizer, while standard for LLM pre