Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessThis International Fact-Checking Day, use these 5 tips to spot AI-generated contentFast Company TechBring state-of-the-art agentic skills to the edge with Gemma 4Google Developers BlogTrump administration appeals ruling that blocked Pentagon action against Anthropic over AI dispute - The Washington PostGNews AI USAThe Corner-StoneLessWrongQuantum-Powered Crypto Mining Is Here—But It Won't Help You Mine BitcoinDecrypt AIv0.20.0-rc1: convert: support new Gemma4 audio_tower tensor naming (#15221)Ollama ReleasesAchieving Single-Digit Microsecond Latency Inference for Capital MarketsNVIDIA Tech BlogService Design in the Age of AI: Why Information Flow Is the New InterfaceMedium AIBringing AI Closer to the Edge and On-Device with Gemma 4NVIDIA Tech Blog5 Ways to Stop Writing Prompts and Start Programming AIMedium AIThe DisplacementMedium AIWorkerMill – open-source AI coding team, multi-expert orchestrationHacker News AI TopBlack Hat USADark ReadingBlack Hat AsiaAI BusinessThis International Fact-Checking Day, use these 5 tips to spot AI-generated contentFast Company TechBring state-of-the-art agentic skills to the edge with Gemma 4Google Developers BlogTrump administration appeals ruling that blocked Pentagon action against Anthropic over AI dispute - The Washington PostGNews AI USAThe Corner-StoneLessWrongQuantum-Powered Crypto Mining Is Here—But It Won't Help You Mine BitcoinDecrypt AIv0.20.0-rc1: convert: support new Gemma4 audio_tower tensor naming (#15221)Ollama ReleasesAchieving Single-Digit Microsecond Latency Inference for Capital MarketsNVIDIA Tech BlogService Design in the Age of AI: Why Information Flow Is the New InterfaceMedium AIBringing AI Closer to the Edge and On-Device with Gemma 4NVIDIA Tech Blog5 Ways to Stop Writing Prompts and Start Programming AIMedium AIThe DisplacementMedium AIWorkerMill – open-source AI coding team, multi-expert orchestrationHacker News AI Top
AI NEWS HUBbyEIGENVECTOREigenvector

Mitigating Forgetting in Continual Learning with Selective Gradient Projection

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2603.26671v1 Announce Type: new Abstract: As neural networks are increasingly deployed in dynamic environments, they face the challenge of catastrophic forgetting, the tendency to overwrite previously learned knowledge when adapting to new tasks, resulting in severe performance degradation on earlier tasks. We propose Selective Forgetting-Aware Optimization (SFAO), a dynamic method that regulates gradient directions via cosine similarity and per-layer gating, enabling controlled forgetting while balancing plasticity and stability. SFAO selectively projects, accepts, or discards updates u — Anika Singh, Aayush Dhaulakhandi, Varun Chopade, Likhith Malipati, David Martinez, Kevin Zhu

View PDF HTML (experimental)

Abstract:As neural networks are increasingly deployed in dynamic environments, they face the challenge of catastrophic forgetting, the tendency to overwrite previously learned knowledge when adapting to new tasks, resulting in severe performance degradation on earlier tasks. We propose Selective Forgetting-Aware Optimization (SFAO), a dynamic method that regulates gradient directions via cosine similarity and per-layer gating, enabling controlled forgetting while balancing plasticity and stability. SFAO selectively projects, accepts, or discards updates using a tunable mechanism with efficient Monte Carlo approximation. Experiments on standard continual learning benchmarks show that SFAO achieves competitive accuracy with markedly lower memory cost, a 90$%$ reduction, and improved forgetting on MNIST datasets, making it suitable for resource-constrained scenarios.

Comments: 15 pages, 2 figures, Accepted to the Student Research Workshop at International Joint Conference on Natural Language Processing & Asia-Pacific Chapter of the Association for Computational Linguistics, 2025

Subjects:

Machine Learning (cs.LG); Optimization and Control (math.OC)

Cite as: arXiv:2603.26671 [cs.LG]

(or arXiv:2603.26671v1 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2603.26671

arXiv-issued DOI via DataCite

Submission history

From: David Martinez [view email] [v1] Sun, 8 Feb 2026 10:24:35 UTC (563 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Mitigating …researchpaperarxivmachine-lea…deep-learni…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 152 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!