AI RESEARCH

GenHSI: Controllable Generation of Human-Scene Interaction Videos

arXiv CS.CV

ArXi:2506.19840v2 Announce Type: replace Large-scale pre-trained video diffusion models have exhibited remarkable capabilities in diverse video generation. However, existing solutions face several challenges in generating long videos with rich human-scene interactions (HSI), including unrealistic dynamics and affordance, lack of subject identity preservation, and the need for expensive