AI RESEARCH

Rethinking UMM Visual Generation: Masked Modeling for Efficient Image-Only Pre-training

arXiv CS.CV

ArXi:2603.16139v1 Announce Type: new Unified Multimodal Models (UMMs) are often constrained by the pre-