AI RESEARCH

ViSAGE @ NTIRE 2026 Challenge on Video Saliency Prediction

arXiv CS.CV

ArXi:2604.08613v1 Announce Type: new In this report, we present our champion solution for the NTIRE 2026 Challenge on Video Saliency Prediction held in conjunction with CVPR 2026. To exploit complementary inductive biases for video saliency, we propose Video Saliency with Adaptive Gated Experts (ViSAGE), a multi-expert ensemble framework. Each specialized decoder performs adaptive gating and modulation to refine spatio-temporal features. The complementary predictions from different experts are then fused at inference.