AI RESEARCH

Before the Body Moves: Learning Anticipatory Joint Intent for Language-Conditioned Humanoid Control

arXiv CS.CV

ArXi:2605.14417v1 Announce Type: cross Natural language is an intuitive interface for humanoid robots, yet streaming whole-body control requires control representations that are executable now and anticipatory of future physical transitions. Existing language-conditioned humanoid systems typically generate kinematic references that a low-level tracker must repair reactively, or use latent/action policies whose outputs do not explicitly encode upcoming contact changes, transfers, and balance preparation.