Research Papers research paper arxiv ai artificial-intelligence

Recent Advances of Multimodal Continual Learning: A Comprehensive Survey

arXivMarch 31, 202610 min read0 views

arXiv:2410.05352v3 Announce Type: replace-cross Abstract: Continual learning (CL) aims to empower machine learning models to learn continually from new data, while building upon previously acquired knowledge without forgetting. As models have evolved from small to large pre-trained architectures, and from supporting unimodal to multimodal data, multimodal continual learning (MMCL) methods have recently emerged. The primary complexity of MMCL is that it extends beyond a simple stacking of unimodal CL methods. Such straightforward approaches often suffer from multimodal catastrophic forgetting, — Dianzhi Yu, Xinni Zhang, Yankai Chen, Aiwei Liu, Yifei Zhang, Philip S. Yu, Irwin King

View PDF

Abstract:Continual learning (CL) aims to empower machine learning models to learn continually from new data, while building upon previously acquired knowledge without forgetting. As models have evolved from small to large pre-trained architectures, and from supporting unimodal to multimodal data, multimodal continual learning (MMCL) methods have recently emerged. The primary complexity of MMCL is that it extends beyond a simple stacking of unimodal CL methods. Such straightforward approaches often suffer from multimodal catastrophic forgetting, yielding unsatisfactory performance. In addition, MMCL introduces new challenges that unimodal CL methods fail to adequately address, including modality imbalance, complex modality interaction, high computational costs, and degradation of pre-trained zero-shot capability of multimodal backbones. In this work, we present the first comprehensive survey on MMCL. We provide essential background knowledge and MMCL settings, as well as a structured taxonomy of MMCL methods. We categorize MMCL methods into four categories, i.e., regularization-based, architecture-based, replay-based, and prompt-based methods, explaining their methodologies and highlighting their key innovations. Additionally, to prompt further research in this field, we summarize open MMCL datasets and benchmarks, provide an in-depth discussion, and discuss several promising future directions. We have also created a GitHub repository for indexing relevant MMCL papers and open resources available at this https URL.

Subjects:

Machine Learning (cs.LG); Artificial Intelligence (cs.AI)

Cite as: arXiv:2410.05352 [cs.LG]

(or arXiv:2410.05352v3 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2410.05352

arXiv-issued DOI via DataCite

Submission history

From: Dianzhi Yu [view email] [v1] Mon, 7 Oct 2024 13:10:40 UTC (2,503 KB) [v2] Fri, 11 Oct 2024 03:50:05 UTC (2,503 KB) [v3] Sat, 28 Mar 2026 11:48:20 UTC (2,568 KB)

Original source

arXiv

https://arxiv.org/abs/2410.05352

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

ModelsLive

"Cognitive surrender" leads AI users to abandon logical thinking, research finds

Article URL: https://arstechnica.com/ai/2026/04/research-finds-ai-users-scarily-willing-to-surrender-their-cognition-to-llms/ Comments URL: https://news.ycombinator.com/item?id=47632504 Points: 5 # Comments: 0

Hacker News AI Top

1m17 minutes ago

ModelsLive

[D] Best websites for pytorch/numpy interviews

Hello, I’m at the last year of my PHD and I’m starting to prepare interviews. I’m mainly aiming at applied scientist/research engineer or research scientist role. For now I’m doing mainly leetcode. I’m looking for websites that can help me train for coding interviews in pytorch/numpy. I did some research and these websites popped up: nexskillai, tensorgym, deep-ml, leetgpu and the torch part of neetcode. However I couldn’t really decide which of these websites are the best. I’m open to suggestions in this matter, thanks. submitted by /u/Training-Adeptness57 [link] [comments]

Reddit r/MachineLearning

1mabout 2 hours ago

Research PapersLive

AI, Price Theory, and the Future of Economics Research

Article URL: https://knowledgeproblem.substack.com/p/ai-price-theory-and-the-future-of Comments URL: https://news.ycombinator.com/item?id=47632631 Points: 1 # Comments: 0

Hacker News AI Top

12m6 minutes ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 128 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Research Papers

Research PapersLive

AI, Price Theory, and the Future of Economics Research

Article URL: https://knowledgeproblem.substack.com/p/ai-price-theory-and-the-future-of Comments URL: https://news.ycombinator.com/item?id=47632631 Points: 1 # Comments: 0

Hacker News AI Top

12m6 minutes ago

Research PapersRecent

Label-free pathological subtyping of non-small cell lung cancer using deep classification and virtual immunohistochemical staining

npj Digital Medicine, Published online: 03 April 2026; doi:10.1038/s41746-026-02557-x Label-free pathological subtyping of non-small cell lung cancer using deep classification and virtual immunohistochemical staining

nature.com

1mabout 22 hours ago

Research PapersFresh

First time NeurIPS. How different is it from low-ranked conferences? [D]

I'm a PhD student and already published papers in A/B ranked paper (10+). My field of work never allowed me to work on something really exciting and a core A* conference. But finally after years I think I have work worthy of some discussion at the top venue. I'm referring to papers (my field and top papers) from previous editions and I notice that there's a big difference on how people write, how they put their message on table and also it is too theoretical sometimes. Are there any golden rules people follow who frequently get into these conferences? Should I be soft while making novelty claims? Also those who moved from submitting to niche-conferences to NeurIPS/ICML/CVPR, did you change your approach? My field is imaging in healthcare. submitted by /u/ade17_in [link] [comments]

Reddit r/MachineLearning

1mabout 2 hours ago

Research PapersFresh

Researchers Discover How to Add Psilocybin, DMT, and Other Psychedelics to Tobacco

AI assisted with the study, which could make it cheaper and easier to produce these mind-bending drugs.

Gizmodo

3mabout 2 hours ago