AI RESEARCH

From Instruction to Event: Sound-Triggered Mobile Manipulation

arXiv CS.CV

ArXi:2601.21667v2 Announce Type: replace-cross Current mobile manipulation research predominantly follows an instruction-driven paradigm, where agents rely on predefined textual commands to execute tasks. However, this setting confines agents to a passive role, limiting their autonomy and ability to react to dynamic environmental events. To address these limitations, we