AI RESEARCH
GenHSI: Controllable Generation of Human-Scene Interaction Videos
arXiv CS.CV
•
ArXi:2506.19840v2 Announce Type: replace Large-scale pre-trained video diffusion models have exhibited remarkable capabilities in diverse video generation. However, existing solutions face several challenges in generating long videos with rich human-scene interactions (HSI), including unrealistic dynamics and affordance, lack of subject identity preservation, and the need for expensive