AI RESEARCH
Rethinking UMM Visual Generation: Masked Modeling for Efficient Image-Only Pre-training
arXiv CS.CV
•
ArXi:2603.16139v1 Announce Type: new Unified Multimodal Models (UMMs) are often constrained by the pre-