Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessHigh-Precision OCR for Medical Device Labeling with RF-DETR and Gemini 2.5 FlashRoboflow BlogNvidia’s AI Powerhouse Rally Ignites Fresh Wall Street Hype - TipRanksGNews AI NVIDIAI Asked ChatGPT To Explain Ethereum to Me Like I’m 12 - Yahoo Finance UKGoogle News: ChatGPTOpenAI Called The One Person AI Startup And Three Founders Proved It - ForbesGoogle News: OpenAItrunk/3dcc1a51f1fb1700a975d91d24f44be49f60e45dPyTorch ReleasesAnthropic Just Leaked Its Own AI Secrets. Here’s What It Means for You.Towards AITutorial - How to Toggle On/OFf the Thinking Mode Directly in LM Studio for Any Thinking ModelReddit r/LocalLLaMAApono Amplifies Agentic AI Security Push With New Privilege Guard Product and RSA 2026 Campaign - TipRanksGNews AI agenticThe Real Reason OpenAI Shut Sora Down Is a Warning to Every AI Startup - FuturismGoogle News: OpenAIDeep Machine Learning - Artificial Neural Network - - TradingViewGoogle News: Machine LearningChinese firms market Iran war intelligence ‘exposing’ U.S. forces - The Washington PostGNews AI military[P] Implemented ACT-R cognitive decay and hyperdimensional computing for AI agent memory (open source)Reddit r/MachineLearningBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessHigh-Precision OCR for Medical Device Labeling with RF-DETR and Gemini 2.5 FlashRoboflow BlogNvidia’s AI Powerhouse Rally Ignites Fresh Wall Street Hype - TipRanksGNews AI NVIDIAI Asked ChatGPT To Explain Ethereum to Me Like I’m 12 - Yahoo Finance UKGoogle News: ChatGPTOpenAI Called The One Person AI Startup And Three Founders Proved It - ForbesGoogle News: OpenAItrunk/3dcc1a51f1fb1700a975d91d24f44be49f60e45dPyTorch ReleasesAnthropic Just Leaked Its Own AI Secrets. Here’s What It Means for You.Towards AITutorial - How to Toggle On/OFf the Thinking Mode Directly in LM Studio for Any Thinking ModelReddit r/LocalLLaMAApono Amplifies Agentic AI Security Push With New Privilege Guard Product and RSA 2026 Campaign - TipRanksGNews AI agenticThe Real Reason OpenAI Shut Sora Down Is a Warning to Every AI Startup - FuturismGoogle News: OpenAIDeep Machine Learning - Artificial Neural Network - - TradingViewGoogle News: Machine LearningChinese firms market Iran war intelligence ‘exposing’ U.S. forces - The Washington PostGNews AI military[P] Implemented ACT-R cognitive decay and hyperdimensional computing for AI agent memory (open source)Reddit r/MachineLearning
AI NEWS HUBbyEIGENVECTOREigenvector

Understanding SAM's Robustness to Noisy Labels through Gradient Down-weighting

arXivMarch 31, 202610 min read1 views
Source Quiz

arXiv:2411.17132v2 Announce Type: replace Abstract: Sharpness-Aware Minimization (SAM) was introduced to improve generalization by seeking flat minima, yet it also exhibits robustness to label noise, a phenomenon that remains only partially understood. Prior work has mainly attributed this effect to SAM's tendency to prolong the learning of clean samples. In this work, we provide a complementary explanation by analyzing SAM at the element-wise level. We show that when noisy gradients dominate a parameter direction, their influence is reduced by the stronger amplification of clean gradients. Th — Hoang-Chau Luong, Quang-Thuc Nguyen, Dat Ba Tran, Minh-Triet Tran

View PDF HTML (experimental)

Abstract:Sharpness-Aware Minimization (SAM) was introduced to improve generalization by seeking flat minima, yet it also exhibits robustness to label noise, a phenomenon that remains only partially understood. Prior work has mainly attributed this effect to SAM's tendency to prolong the learning of clean samples. In this work, we provide a complementary explanation by analyzing SAM at the element-wise level. We show that when noisy gradients dominate a parameter direction, their influence is reduced by the stronger amplification of clean gradients. This slows the memorization of noisy labels while sustaining clean learning, offering a more complete account of SAM's robustness. Building on this insight, we propose SANER (Sharpness-Aware Noise-Explicit Reweighting), a simple variant of SAM that explicitly magnifies this down-weighting effect. Experiments on benchmark image classification tasks with noisy labels demonstrate that SANER significantly mitigates noisy-label memorization and improves generalization over both SAM and SGD. Moreover, since SANER is designed from the mechanism of SAM, it can also be seamlessly integrated into SAM-like variants, further boosting their robustness.

Subjects:

Machine Learning (cs.LG)

Cite as: arXiv:2411.17132 [cs.LG]

(or arXiv:2411.17132v2 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2411.17132

arXiv-issued DOI via DataCite

Submission history

From: Hoang-Chau Luong [view email] [v1] Tue, 26 Nov 2024 05:54:12 UTC (849 KB) [v2] Mon, 30 Mar 2026 17:14:51 UTC (282 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Understandi…researchpaperarxivmachine-lea…deep-learni…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 189 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers