Models model language model neural network benchmark training announce

An Isotropic Approach to Efficient Uncertainty Quantification with Gradient Norms

arXiv cs.LGby Nils Gr\"unefeld, Jes Frellsen, Christian HardmeierApril 1, 20262 min read0 views

arXiv:2603.29466v1 Announce Type: new Abstract: Existing methods for quantifying predictive uncertainty in neural networks are either computationally intractable for large language models or require access to training data that is typically unavailable. We derive a lightweight alternative through two approximations: a first-order Taylor expansion that expresses uncertainty in terms of the gradient of the prediction and the parameter covariance, and an isotropy assumption on the parameter covariance. Together, these yield epistemic uncertainty as the squared gradient norm and aleatoric uncertainty as the Bernoulli variance of the point prediction, from a single forward-backward pass through an unmodified pretrained model. We justify the isotropy assumption by showing that covariance estimat

View PDF HTML (experimental)

Abstract:Existing methods for quantifying predictive uncertainty in neural networks are either computationally intractable for large language models or require access to training data that is typically unavailable. We derive a lightweight alternative through two approximations: a first-order Taylor expansion that expresses uncertainty in terms of the gradient of the prediction and the parameter covariance, and an isotropy assumption on the parameter covariance. Together, these yield epistemic uncertainty as the squared gradient norm and aleatoric uncertainty as the Bernoulli variance of the point prediction, from a single forward-backward pass through an unmodified pretrained model. We justify the isotropy assumption by showing that covariance estimates built from non-training data introduce structured distortions that isotropic covariance avoids, and that theoretical results on the spectral properties of large networks support the approximation at scale. Validation against reference Markov Chain Monte Carlo estimates on synthetic problems shows strong correspondence that improves with model size. We then use the estimates to investigate when each uncertainty type carries useful signal for predicting answer correctness in question answering with large language models, revealing a benchmark-dependent divergence: the combined estimate achieves the highest mean AUROC on TruthfulQA, where questions involve genuine conflict between plausible answers, but falls to near chance on TriviaQA's factual recall, suggesting that parameter-level uncertainty captures a fundamentally different signal than self-assessment methods.

Subjects:

Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Cite as: arXiv:2603.29466 [cs.LG]

(or arXiv:2603.29466v1 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2603.29466

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Nils Grünefeld [view email] [v1] Tue, 31 Mar 2026 09:13:09 UTC (939 KB)

Original source

arXiv cs.LG

https://arxiv.org/abs/2603.29466

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modellanguage modelneural network

ModelsLive

[D] Best websites for pytorch/numpy interviews

Hello, I’m at the last year of my PHD and I’m starting to prepare interviews. I’m mainly aiming at applied scientist/research engineer or research scientist role. For now I’m doing mainly leetcode. I’m looking for websites that can help me train for coding interviews in pytorch/numpy. I did some research and these websites popped up: nexskillai, tensorgym, deep-ml, leetgpu and the torch part of neetcode. However I couldn’t really decide which of these websites are the best. I’m open to suggestions in this matter, thanks. submitted by /u/Training-Adeptness57 [link] [comments]

Reddit r/MachineLearning

1mabout 2 hours ago

ModelsLive

Meta Pauses Work With Mercor After Data Breach Puts AI Industry Secrets at Risk

Major AI labs are investigating a security incident that impacted Mercor, a leading data vendor. The incident could have exposed key data about how they train AI models.

Wired AI

4m15 minutes ago

ReleasesLive

Google launches Gemma 4, an enterprise-grade open source AI model set - CIO Dive

Google launches Gemma 4, an enterprise-grade open source AI model set CIO Dive

GNews AI Gemma

1mabout 1 hour ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 127 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Models

ModelsLive

Show HN: AI agent skills for affiliate marketing (Markdown, works with any LLM)

Article URL: https://github.com/Affitor/affiliate-skills Comments URL: https://news.ycombinator.com/item?id=47632530 Points: 1 # Comments: 0

Hacker News AI Top

1m15 minutes ago

ModelsLive

"Cognitive surrender" leads AI users to abandon logical thinking, research finds

Article URL: https://arstechnica.com/ai/2026/04/research-finds-ai-users-scarily-willing-to-surrender-their-cognition-to-llms/ Comments URL: https://news.ycombinator.com/item?id=47632504 Points: 5 # Comments: 0

Hacker News AI Top

1m17 minutes ago

ModelsLive

[D] Best websites for pytorch/numpy interviews

Reddit r/MachineLearning

1mabout 2 hours ago

ModelsLive

Meta Pauses Work With Mercor After Data Breach Puts AI Industry Secrets at Risk

Major AI labs are investigating a security incident that impacted Mercor, a leading data vendor. The incident could have exposed key data about how they train AI models.

Wired AI

4m15 minutes ago