AI RESEARCH
UNCOM: Zero-shot Context-Aware Command Understanding for Tabletop Scenarios
arXiv CS.AI
•
ArXi:2410.06355v3 Announce Type: replace-cross This paper presents UNCOM, a novel hybrid framework for interpreting natural human commands in tabletop scenarios. The system integrates multiple sources of information -- speech, gestures, and scene context -- to extract structured, actionable instructions for robots. Addressing the need for general-purpose human-robot interaction in domestic environments, UNCOM is designed for zero-shot operation, without reliance on predefined object models or.