AI RESEARCH

InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision

arXiv CS.CV

ArXi:2512.01342v2 Announce Type: replace Large-scale video-text pre