AI RESEARCH

Omni-Masked Gradient Descent: Memory-Efficient Optimization via Mask Traversal with Improved Convergence

arXiv CS.LG

ArXi:2603.05960v1 Announce Type: new Memory-efficient optimization methods have recently gained increasing attention for scaling full-parameter