AI RESEARCH

Pulp Motion: Framing-aware multimodal camera and human motion generation

arXiv CS.CV

ArXi:2510.05097v2 Announce Type: replace-cross Treating human motion and camera trajectory generation separately overlooks a core principle of cinematography: the tight interplay between actor performance and camera work in the screen space. In this paper, we are the first to cast this task as a text-conditioned joint generation, aiming to maintain consistent on-screen framing while producing two heterogeneous, yet intrinsically linked, modalities: human motion and camera trajectories.