AI RESEARCH
InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision
arXiv CS.CV
•
ArXi:2512.01342v2 Announce Type: replace Large-scale video-text pre