Mitigating Forgetting in Continual Learning with Selective Gradient Projection
arXiv:2603.26671v1 Announce Type: new Abstract: As neural networks are increasingly deployed in dynamic environments, they face the challenge of catastrophic forgetting, the tendency to overwrite previously learned knowledge when adapting to new tasks, resulting in severe performance degradation on earlier tasks. We propose Selective Forgetting-Aware Optimization (SFAO), a dynamic method that regulates gradient directions via cosine similarity and per-layer gating, enabling controlled forgetting while balancing plasticity and stability. SFAO selectively projects, accepts, or discards updates u — Anika Singh, Aayush Dhaulakhandi, Varun Chopade, Likhith Malipati, David Martinez, Kevin Zhu
View PDF HTML (experimental)
Abstract:As neural networks are increasingly deployed in dynamic environments, they face the challenge of catastrophic forgetting, the tendency to overwrite previously learned knowledge when adapting to new tasks, resulting in severe performance degradation on earlier tasks. We propose Selective Forgetting-Aware Optimization (SFAO), a dynamic method that regulates gradient directions via cosine similarity and per-layer gating, enabling controlled forgetting while balancing plasticity and stability. SFAO selectively projects, accepts, or discards updates using a tunable mechanism with efficient Monte Carlo approximation. Experiments on standard continual learning benchmarks show that SFAO achieves competitive accuracy with markedly lower memory cost, a 90$%$ reduction, and improved forgetting on MNIST datasets, making it suitable for resource-constrained scenarios.
Comments: 15 pages, 2 figures, Accepted to the Student Research Workshop at International Joint Conference on Natural Language Processing & Asia-Pacific Chapter of the Association for Computational Linguistics, 2025
Subjects:
Machine Learning (cs.LG); Optimization and Control (math.OC)
Cite as: arXiv:2603.26671 [cs.LG]
(or arXiv:2603.26671v1 [cs.LG] for this version)
https://doi.org/10.48550/arXiv.2603.26671
arXiv-issued DOI via DataCite
Submission history
From: David Martinez [view email] [v1] Sun, 8 Feb 2026 10:24:35 UTC (563 KB)
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
researchpaperarxiv
AI Regulation Insights
As Canada s trusted partner in AI advancement, Vector Institute continues to bridge cutting-edge research with practical industry applications through strategic initiatives. In response to the rapidly evolving AI regulatory landscape, [ ] The post AI Regulation Insights appeared first on Vector Institute for Artificial Intelligence .

Thought Cloning: Teaching AI to Think Like Humans for Better Decision-Making
New research from Vector Faculty Member Jeff Clune and Vector Graduate Student Shengran Hu introduces a groundbreaking approach to imitation learning that could potentially revolutionize how we train AI agents. [ ] The post Thought Cloning: Teaching AI to Think Like Humans for Better Decision-Making appeared first on Vector Institute for Artificial Intelligence .

Recommender Systems: Where Academia Meets Industry
Authors: Shaina Raza, Amirmohammad Kazemeini This blog is based on the survey paper “A Comprehensive Review of Recommender Systems.” Recommender Systems (RS) blend artificial intelligence (AI) and personalization in a [ ] The post Recommender Systems: Where Academia Meets Industry appeared first on Vector Institute for Artificial Intelligence .
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Research Papers

Quantum computers might crack today's encryption far sooner than we thought
According to a study by engineers at Caltech and the UC Department of Physics, quantum computers do not need to be nearly as powerful as previously believed to crack the most advanced cryptographic technologies. The research claims that Shor's algorithm could break RSA public-key encryption using quantum computers with just... Read Entire Article


Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!