AI RESEARCH
ER-Pose: Rethinking Keypoint-Driven Representation Learning for Real-Time Human Pose Estimation
arXiv CS.CV
•
ArXi:2603.08681v1 Announce Type: new Single-stage multi-person pose estimation aims to jointly perform human localization and keypoint prediction within a unified framework, offering advantages in inference efficiency and architectural simplicity. Consequently, multi-scale real-time detection architectures, such as YOLO-like models, are widely adopted for real-time pose estimation. However, these approaches typically inherit a box-driven modeling paradigm from object detection, in which pose estimation is implicitly constrained by bounding-box supervision during