AI RESEARCH

Octopus: History-Free Gradient Orthogonalization for Continual Learning in Multimodal Large Language Models

arXiv CS.LG

ArXi:2605.14938v1 Announce Type: new Continual learning in multimodal large language models (MLLMs) aims to sequentially acquire knowledge while mitigating catastrophic forgetting, yet existing methods face inherent limitations: architecture-based approaches incur additional computational overhead and often generalize poorly to new tasks, rehearsal-based methods rely on storing historical data, raising privacy and storage concerns, and conventional regularization-based strategies alone are insufficient to fully prevent parameter interference.