AI RESEARCH
Omni-Masked Gradient Descent: Memory-Efficient Optimization via Mask Traversal with Improved Convergence
arXiv CS.LG
•
ArXi:2603.05960v1 Announce Type: new Memory-efficient optimization methods have recently gained increasing attention for scaling full-parameter