AI RESEARCH

Real2Sim in HOI: Toward Physically Plausible HOI Reconstruction from Monocular Videos

arXiv CS.CV

ArXi:2605.14462v1 Announce Type: new Recovering 4D human-object interaction (HOI) from monocular video is a key step toward scalable 3D content creation, embodied AI, and simulation-based learning. Recent methods can reconstruct temporally coherent human and object trajectories, but these trajectories often remain visual artifacts while failing to preserve stable contact, functional manipulation, or physical plausibility when used as reference motions for humanoid-object simulation.