AI RESEARCH

Prompt-to-Gesture: Measuring the Capabilities of Image-to-Video Deictic Gesture Generation

arXiv CS.CV

ArXi:2604.14953v1 Announce Type: new Gesture recognition research, unlike NLP, continues to face acute data scarcity, with progress constrained by the need for costly human recordings or image processing approaches that cannot generate authentic variability in the gestures themselves. Recent advancements in image-to-video foundation models have enabled the generation of photorealistic, semantically rich videos guided by natural language.