Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessYour DNS is Lying to YouDEV CommunityYour Process Doesn't Exist AloneDEV CommunityClaude Code Source Leaked: 5 Hidden Features Found in 510K Lines of CodeDEV CommunityOpenAI Just Shipped a Plugin So Codex Runs Inside Claude CodeDEV CommunityThe Parallel Lanes Nobody UsesDEV CommunityCodiumAI Alternatives: Best AI Testing ToolsDEV CommunityAGI CPU: Arm’s $100B AI Silicon Tightrope Walk Without Undermining Its LicenseesEE TimesFile Descriptors: The Numbers Behind EverythingDEV CommunityYour String is Not What You Think It IsDEV CommunityWelcome to Transitive Dependency HellDEV CommunityWhat Happens When You Press a KeyDEV Communityv1.83.0-nightlyLiteLLM ReleasesBlack Hat USADark ReadingBlack Hat AsiaAI BusinessYour DNS is Lying to YouDEV CommunityYour Process Doesn't Exist AloneDEV CommunityClaude Code Source Leaked: 5 Hidden Features Found in 510K Lines of CodeDEV CommunityOpenAI Just Shipped a Plugin So Codex Runs Inside Claude CodeDEV CommunityThe Parallel Lanes Nobody UsesDEV CommunityCodiumAI Alternatives: Best AI Testing ToolsDEV CommunityAGI CPU: Arm’s $100B AI Silicon Tightrope Walk Without Undermining Its LicenseesEE TimesFile Descriptors: The Numbers Behind EverythingDEV CommunityYour String is Not What You Think It IsDEV CommunityWelcome to Transitive Dependency HellDEV CommunityWhat Happens When You Press a KeyDEV Communityv1.83.0-nightlyLiteLLM Releases

Improving Infinitely Deep Bayesian Neural Networks with Nesterov's Accelerated Gradient Method

arXivMarch 26, 202610 min read0 views
Source Quiz

As a representative continuous-depth neural network approach, stochastic differential equation (SDE)-based Bayesian neural networks (BNNs) have attracted considerable attention due to their solid theoretical foundations and strong potential for real-world applications. However, their reliance on numerical SDE solvers inevitably incurs a large number of function evaluations (NFEs), resulting in high computational cost and occasional convergence instability. To address these challenges, we propose a Nesterov-accelerated gradient (NAG) enhanced SDE-BNN model. By integrating NAG into the SDE-BNN f — Chenxu Yu, Wenqi Fang

View PDF HTML (experimental)

Abstract:As a representative continuous-depth neural network approach, stochastic differential equation (SDE)-based Bayesian neural networks (BNNs) have attracted considerable attention due to their solid theoretical foundations and strong potential for real-world applications. However, their reliance on numerical SDE solvers inevitably incurs a large number of function evaluations (NFEs), resulting in high computational cost and occasional convergence instability. To address these challenges, we propose a Nesterov-accelerated gradient (NAG) enhanced SDE-BNN model. By integrating NAG into the SDE-BNN framework along with an NFE-dependent residual skip connection, our method accelerates convergence and substantially reduces NFEs during both training and testing. Extensive empirical results show that our model consistently outperforms conventional SDE-BNNs across various tasks, including image classification and sequence modeling, achieving lower NFEs and improved predictive accuracy.

Subjects:

Machine Learning (stat.ML); Machine Learning (cs.LG)

Cite as: arXiv:2603.25024 [stat.ML]

(or arXiv:2603.25024v1 [stat.ML] for this version)

https://doi.org/10.48550/arXiv.2603.25024

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Wenqi Fang [view email] [v1] Thu, 26 Mar 2026 04:42:27 UTC (224 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Improving I…researchpaperarxivstatisticsmachine-lea…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 159 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers