AI RESEARCH

GRAZE: Grounded Refinement and Motion-Aware Zero-Shot Event Localization

arXiv CS.CV

ArXi:2604.01383v1 Announce Type: new American football practice generates video at scale, yet the interaction of interest occupies only a brief window of each long, untrimmed clip. Reliable biomechanical analysis, therefore, depends on spatiotemporal localization that identifies both the interacting entities and the onset of contact. We study First Point of Contact (FPOC), defined as the first frame in which a player physically touches a tackle dummy, in unconstrained practice footage with camera motion, clutter, multiple similarly equipped athletes, and rapid pose changes around impact.