AI RESEARCH

Visually-grounded Humanoid Agents

arXiv CS.CV

ArXi:2604.08509v1 Announce Type: new Digital human generation has been studied for decades and s a wide range of real-world applications. However, most existing systems are passively animated, relying on privileged state or scripted control, which limits scalability to novel environments. We instead ask: how can digital humans actively behave using only visual observations and specified goals in novel scenes? Achieving this would enable populating any 3D environments with digital humans at scale that exhibit spontaneous, natural, goal-directed behaviors. To this end, we