AI RESEARCH

SAIL: Similarity-Aware Guidance and Inter-Caption Augmentation-based Learning for Weakly-Supervised Dense Video Captioning

arXiv CS.CV • March 10, 2026

ArXi:2603.05437v2 Announce Type: replace Weakly-Supervised Dense Video Captioning aims to localize and describe events in videos trained only on caption annotations, without temporal boundaries. Prior work

Read Full Article