AI RESEARCH

MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data

arXiv CS.CV

ArXi:2603.25319v1 Announce Type: new Generating images conditioned on multiple visual references is critical for real-world applications such as multi-subject composition, narrative illustration, and novel view synthesis, yet current models suffer from severe performance degradation as the number of input references grows. We identify the root cause as a fundamental data bottleneck: existing datasets are dominated by single- or few-reference pairs and lack the structured, long-context supervision needed to learn dense inter-reference dependencies. To address this, we.