AI RESEARCH

EgoKit: Towards Unified Low-Cost Egocentric Data Collection with Heterogeneous Devices

arXiv CS.CV

ArXi:2605.16797v1 Announce Type: new Egocentric video is increasingly used as a data source for robot learning, activity understanding, and embodied AI research, but collecting it at scale remains fragmented in practice: each candidate host device, such as an Android phone, iPhone, iPad, smart glasses, or extended reality (XR) headset, exposes a different SDK, a different policy on raw camera access, and different limitations on external USB cameras and on-device tracking. Synchronized ego-view and wrist-view capture is therefore typically obtained by either committing to a single.