AI RESEARCH

InfBaGel: Human-Object-Scene Interaction Generation with Dynamic Perception and Iterative Refinement

arXiv CS.AI

ArXi:2604.04843v1 Announce Type: cross Human-object-scene interactions (HOSI) generation has broad applications in embodied AI, simulation, and animation. Unlike human-object interaction (HOI) and human-scene interaction (HSI), HOSI generation requires reasoning over dynamic object-scene changes, yet suffers from limited annotated data. To address these issues, we propose a coarse-to-fine instruction-conditioned interaction generation framework that is explicitly aligned with the iterative denoising process of a consistency model.