AI RESEARCH
EggHand: A Multimodal Foundation Model for Egocentric Hand Pose Forecasting
arXiv CS.CV
•
ArXi:2605.07642v1 Announce Type: new Forecasting future 3D hand pose sequences from egocentric video is essential for understanding human intention and enabling embodied applications such as AR/VR assistance and human-robot interaction. However, this task remains a highly challenging problem because egocentric hand motion is driven by complex human intent, exhibits highly dexterous articulations, and is observed under drastic viewpoint shifts induced by ego-motion. In this work, we