AI RESEARCH
ReCap: Lightweight Referential Grounding for Coherent Story Visualization
arXiv CS.CV
•
ArXi:2604.18575v1 Announce Type: new Story Visualization aims to generate a sequence of images that faithfully depicts a textual narrative that preserve character identity, spatial configuration, and stylistic coherence as the narratives unfold. Maintaining such cross-frame consistency has traditionally relied on explicit memory banks, architectural expansion, or auxiliary language models, resulting in substantial parameter growth and inference overhead. We