AI RESEARCH

Text-to-Stage: Spatial Layouts from Long-form Narratives

arXiv CS.LG

ArXi:2603.17832v1 Announce Type: cross In this work, we probe the ability of a language model to nstrate spatial reasoning from unstructured text, mimicking human capabilities and automating a process that benefits many downstream media applications. Concretely, we study the narrative-to-play task: inferring stage-play layouts (scenes, speaker positions, movements, and room types) from text that lacks explicit spatial, positional, or relational cues. We then