Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessI Built an MCP Server That Understands Your MSBuild Project Graph — Before You BuildDEV CommunityAgent Middleware in Microsoft Agent Framework 1.0DEV Communityکود کشاورزی — Complete GuideDEV CommunityHow I Track My AI Spending as a Solo Dev (Without Going Broke)DEV CommunityWe Shipped an AI Song Generator. The Hardest Part Wasn't the AI.DEV CommunityPOTS explained: The disorder that forced OpenAI exec Fidji Simo to take medical leaveBusiness InsiderWhat is POTS, the disorder that forced OpenAI exec Fidji Simo to take medical leave - Business InsiderGoogle News: OpenAI"I Built a Web Browser from Scratch in 42 Days — No Libraries, Just Node.js"DEV CommunityWhy 80% of AI Projects Fail — And the 4-Layer Readiness Framework That Changes the OddsTowards AIWhy Your Data Governance is Already ObsoleteAI YouTube Channel 35How to Clean Up Xcode and Free 30-50GB on Your MacDEV CommunityNo Fooling, Spaceballs 2 Will Hit Theaters April 2027GizmodoBlack Hat USADark ReadingBlack Hat AsiaAI BusinessI Built an MCP Server That Understands Your MSBuild Project Graph — Before You BuildDEV CommunityAgent Middleware in Microsoft Agent Framework 1.0DEV Communityکود کشاورزی — Complete GuideDEV CommunityHow I Track My AI Spending as a Solo Dev (Without Going Broke)DEV CommunityWe Shipped an AI Song Generator. The Hardest Part Wasn't the AI.DEV CommunityPOTS explained: The disorder that forced OpenAI exec Fidji Simo to take medical leaveBusiness InsiderWhat is POTS, the disorder that forced OpenAI exec Fidji Simo to take medical leave - Business InsiderGoogle News: OpenAI"I Built a Web Browser from Scratch in 42 Days — No Libraries, Just Node.js"DEV CommunityWhy 80% of AI Projects Fail — And the 4-Layer Readiness Framework That Changes the OddsTowards AIWhy Your Data Governance is Already ObsoleteAI YouTube Channel 35How to Clean Up Xcode and Free 30-50GB on Your MacDEV CommunityNo Fooling, Spaceballs 2 Will Hit Theaters April 2027Gizmodo
AI NEWS HUBbyEIGENVECTOREigenvector

Sven: Singular Value Descent as a Computationally Efficient Natural Gradient Method

arXivby [Submitted on 1 Apr 2026]April 3, 20262 min read1 views
Source Quiz

arXiv:2604.01279v1 Announce Type: new Abstract: We introduce Sven (Singular Value dEsceNt), a new optimization algorithm for neural networks that exploits the natural decomposition of loss functions into a sum over individual data points, rather than reducing the full loss to a single scalar before computing a parameter update. Sven treats each data point's residual as a separate condition to be satisfied simultaneously, using the Moore-Penrose pseudoinverse of the loss Jacobian to find the minimum-norm parameter update that best satisfies all conditions at once. In practice, this pseudoinvers — Samuel Bright-Thonney, Thomas R. Harvey, Andre Lukas, Jesse Thaler

View PDF HTML (experimental)

Abstract:We introduce Sven (Singular Value dEsceNt), a new optimization algorithm for neural networks that exploits the natural decomposition of loss functions into a sum over individual data points, rather than reducing the full loss to a single scalar before computing a parameter update. Sven treats each data point's residual as a separate condition to be satisfied simultaneously, using the Moore-Penrose pseudoinverse of the loss Jacobian to find the minimum-norm parameter update that best satisfies all conditions at once. In practice, this pseudoinverse is approximated via a truncated singular value decomposition, retaining only the $k$ most significant directions and incurring a computational overhead of only a factor of $k$ relative to stochastic gradient descent. This is in comparison to traditional natural gradient methods, which scale as the square of the number of parameters. We show that Sven can be understood as a natural gradient method generalized to the over-parametrized regime, recovering natural gradient descent in the under-parametrized limit. On regression tasks, Sven significantly outperforms standard first-order methods including Adam, converging faster and to a lower final loss, while remaining competitive with LBFGS at a fraction of the wall-time cost. We discuss the primary challenge to scaling, namely memory overhead, and propose mitigation strategies. Beyond standard machine learning benchmarks, we anticipate that Sven will find natural application in scientific computing settings where custom loss functions decompose into several conditions.

Subjects:

Machine Learning (cs.LG); Artificial Intelligence (cs.AI); High Energy Physics - Theory (hep-th); Optimization and Control (math.OC)

Report number: MIT-CTP/6022

Cite as: arXiv:2604.01279 [cs.LG]

(or arXiv:2604.01279v1 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2604.01279

arXiv-issued DOI via DataCite

Submission history

From: Thomas Harvey [view email] [v1] Wed, 1 Apr 2026 18:00:07 UTC (254 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Sven: Singu…researchpaperarxivmachine-lea…deep-learni…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 133 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers