AI RESEARCH
ByteLoom: Weaving Geometry-Consistent Human-Object Interactions through Progressive Curriculum Learning
arXiv CS.LG
•
ArXi:2512.22854v2 Announce Type: replace-cross Human-object interaction (HOI) video generation has garnered increasing attention due to its promising applications in digital humans, e-commerce, advertising, and robotics imitation learning. However, existing methods face two critical limitations: (1) a lack of effective mechanisms to inject multi-view information of the object into the model, leading to poor cross-view consistency, and (2) heavy reliance on fine-grained hand mesh annotations for modeling interaction occlusions. To address these challenges, we