Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessLetters to Sen. Ed Markey: six autonomous vehicle companies say remote assistants don't directly control vehicles; Tesla says its operators are allowed to do so (Aarian Marshall/Wired)TechmemeAnthropic Just Leaked Claude Code's Source. Here's What It Means for Your Vibe-Coded App.DEV CommunityYou're a slop coder. Autospec is for professionals only.DEV CommunityWhat Happened to CodiumAI? The Rebrand to Qodo ExplainedDEV CommunityAIによる雇用破壊はまだ限定的——だが、従来の指標では本当の影響は見えないCIO MagazineWhat Karpathy's Autoresearch Unlocked for MeDEV CommunityBitcoin enters the public bond market as Moody’s gives a first-of-its-kind crypto deal a ratingCoinDesk AIOpenClaw Creem agentDEV CommunityStock Market Today, March 31: Nvidia Rises on $2 Billion Marvell AI Infrastructure Partnership - The Motley FoolGNews AI NVIDIAVolt Typhoon Weaponized SOHO Routers at Scale — Here's Your Zero-Trust Playbook for the Remote EdgeDEV CommunityDeep Dive into vLLM: How PagedAttention & Continuous Batching Revolutionized LLM InferenceDEV CommunityFour futures of AI: Life sciences - EYGoogle News: AIBlack Hat USADark ReadingBlack Hat AsiaAI BusinessLetters to Sen. Ed Markey: six autonomous vehicle companies say remote assistants don't directly control vehicles; Tesla says its operators are allowed to do so (Aarian Marshall/Wired)TechmemeAnthropic Just Leaked Claude Code's Source. Here's What It Means for Your Vibe-Coded App.DEV CommunityYou're a slop coder. Autospec is for professionals only.DEV CommunityWhat Happened to CodiumAI? The Rebrand to Qodo ExplainedDEV CommunityAIによる雇用破壊はまだ限定的——だが、従来の指標では本当の影響は見えないCIO MagazineWhat Karpathy's Autoresearch Unlocked for MeDEV CommunityBitcoin enters the public bond market as Moody’s gives a first-of-its-kind crypto deal a ratingCoinDesk AIOpenClaw Creem agentDEV CommunityStock Market Today, March 31: Nvidia Rises on $2 Billion Marvell AI Infrastructure Partnership - The Motley FoolGNews AI NVIDIAVolt Typhoon Weaponized SOHO Routers at Scale — Here's Your Zero-Trust Playbook for the Remote EdgeDEV CommunityDeep Dive into vLLM: How PagedAttention & Continuous Batching Revolutionized LLM InferenceDEV CommunityFour futures of AI: Life sciences - EYGoogle News: AI

Correcting Auto-Differentiation in Neural-ODE Training

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2306.02192v3 Announce Type: replace Abstract: Does the use of auto-differentiation yield reasonable updates for deep neural networks (DNNs)? Specifically, when DNNs are designed to adhere to neural ODE architectures, can we trust the gradients provided by auto-differentiation? Through mathematical analysis and numerical evidence, we demonstrate that when neural networks employ high-order methods, such as Linear Multistep Methods (LMM) or Explicit Runge-Kutta Methods (ERK), to approximate the underlying ODE flows, brute-force auto-differentiation often introduces artificial oscillations i — Yewei Xu, Shi Chen, Qin Li

View PDF HTML (experimental)

Abstract:Does the use of auto-differentiation yield reasonable updates for deep neural networks (DNNs)? Specifically, when DNNs are designed to adhere to neural ODE architectures, can we trust the gradients provided by auto-differentiation? Through mathematical analysis and numerical evidence, we demonstrate that when neural networks employ high-order methods, such as Linear Multistep Methods (LMM) or Explicit Runge-Kutta Methods (ERK), to approximate the underlying ODE flows, brute-force auto-differentiation often introduces artificial oscillations in the gradients that prevent convergence. In the case of Leapfrog and 2-stage ERK, we propose simple post-processing techniques that effectively eliminates these oscillations, correct the gradient computation and thus returns the accurate updates.

Comments: Accepted for publication in SIAM Journal on Applied Mathematics. This version corresponds to the final draft, prior to copyediting and production

Subjects:

Machine Learning (cs.LG); Numerical Analysis (math.NA)

MSC classes: 65D25 (Primary), 65L06, 90C31 (Secondary)

Cite as: arXiv:2306.02192 [cs.LG]

(or arXiv:2306.02192v3 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2306.02192

arXiv-issued DOI via DataCite

Submission history

From: Yewei Xu [view email] [v1] Sat, 3 Jun 2023 20:34:14 UTC (794 KB) [v2] Wed, 3 Sep 2025 01:05:31 UTC (143 KB) [v3] Fri, 27 Mar 2026 22:56:52 UTC (186 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Correcting …researchpaperarxivmachine-lea…deep-learni…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 123 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers