AI RESEARCH

ReCap: Lightweight Referential Grounding for Coherent Story Visualization

arXiv CS.CV

ArXi:2604.18575v1 Announce Type: new Story Visualization aims to generate a sequence of images that faithfully depicts a textual narrative that preserve character identity, spatial configuration, and stylistic coherence as the narratives unfold. Maintaining such cross-frame consistency has traditionally relied on explicit memory banks, architectural expansion, or auxiliary language models, resulting in substantial parameter growth and inference overhead. We