AI RESEARCH

TrackMAE: Video Representation Learning via Track Mask and Predict

arXiv CS.CV • March 31, 2026

ArXi:2603.27268v1 Announce Type: new Masked video modeling (MVM) has emerged as a simple and scalable self-supervised pre