AI RESEARCH

UniGame: Turning a Unified Multimodal Model Into Its Own Adversary

arXiv CS.AI

ArXi:2511.19413v3 Announce Type: replace-cross Unified Multimodal Models (UMMs) have shown impressive performance in both understanding and generation with a single architecture. However, UMMs still exhibit a fundamental inconsistency: understanding favors compact embeddings, whereas generation favors reconstruction-rich representations. This structural trade-off produces misaligned decision boundaries, degraded cross-modal coherence, and heightened vulnerability under distributional and adversarial shifts. In this paper, we present UniGame, a self-adversarial post-