HyperTokens: Controlling Token Dynamics for Continual Video-Language Understanding

ArXi:2603.06662v1 Announce Type: cross Continual VideoQA with multimodal LLMs is hindered by interference between tasks and the prohibitive cost of storing task-specific prompts. We