Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessWe found $50k in forgotten subscriptionsDev.to AISMD/飞达 吸嘴、贴片机、物料车、刮刀等耗材对应的场景,以及这些耗材的市场行情,还有对应场景下的经济模式,处在哪个生态位上能够获得比较可观的收益Dev.to AIЯ автоматизировал 80% задач и уволил себя самDev.to AIIs 32GB RAM Enough for Developers in 2026? Or Will It Slow You Down?Medium AIYou can use Google Meet with CarPlay now: How to join meetings safely in your carZDNet Big DataWe Cut Our LLM Inference Bill by 73% Without Degrading Clinical AccuracyMedium AII Think I Found the Best Way to Rank in LLMsMedium AII Tested Gemma 4 on My Laptop and Turned It Into a Free Intelligence Layer for My AI AppsDev.to AIDesigning AI-Powered Event-Driven Systems: When Kafka Meets Intelligent AgentsMedium AIFrom Bit to Being: Why the Next AI Revolution Is Not Technical, but ConsciousMedium AIFrom APIs to AI Agents: How Backend Systems Are Evolving in 2026Medium AIThe AI Ascent and the No-Code Evolution Reshaping Software DevelopmentDev.to AIBlack Hat USADark ReadingBlack Hat AsiaAI BusinessWe found $50k in forgotten subscriptionsDev.to AISMD/飞达 吸嘴、贴片机、物料车、刮刀等耗材对应的场景,以及这些耗材的市场行情,还有对应场景下的经济模式,处在哪个生态位上能够获得比较可观的收益Dev.to AIЯ автоматизировал 80% задач и уволил себя самDev.to AIIs 32GB RAM Enough for Developers in 2026? Or Will It Slow You Down?Medium AIYou can use Google Meet with CarPlay now: How to join meetings safely in your carZDNet Big DataWe Cut Our LLM Inference Bill by 73% Without Degrading Clinical AccuracyMedium AII Think I Found the Best Way to Rank in LLMsMedium AII Tested Gemma 4 on My Laptop and Turned It Into a Free Intelligence Layer for My AI AppsDev.to AIDesigning AI-Powered Event-Driven Systems: When Kafka Meets Intelligent AgentsMedium AIFrom Bit to Being: Why the Next AI Revolution Is Not Technical, but ConsciousMedium AIFrom APIs to AI Agents: How Backend Systems Are Evolving in 2026Medium AIThe AI Ascent and the No-Code Evolution Reshaping Software DevelopmentDev.to AI
AI NEWS HUBbyEIGENVECTOREigenvector

Quantifying Confidence in Assurance 2.0 Arguments

arXiv cs.SEby Robin Bloomfield (City St George's, University of London), John Rushby (SRI)April 2, 20261 min read0 views
Source Quiz

arXiv:2604.00034v1 Announce Type: new Abstract: Confidence is central to safety and assurance cases: how much confidence a decision requires and how much the argument actually provides are both important questions. We present a new method for assessing probabilistic confidence in assurance case arguments that is simple, systematic and sound. It exploits the ways claims are decomposed in a structured argument and provides different approaches according to the different degrees of (in)dependence and diversity among subclaims and the way they eliminate concerns that undermine confidence in their parent claims. The method uses only elementary probabilistic constructions that are well-known in other contexts (e.g., Frechet bounds) but we interpret and apply them in a manner that is specifically

View PDF HTML (experimental)

Abstract:Confidence is central to safety and assurance cases: how much confidence a decision requires and how much the argument actually provides are both important questions. We present a new method for assessing probabilistic confidence in assurance case arguments that is simple, systematic and sound. It exploits the ways claims are decomposed in a structured argument and provides different approaches according to the different degrees of (in)dependence and diversity among subclaims and the way they eliminate concerns that undermine confidence in their parent claims. The method uses only elementary probabilistic constructions that are well-known in other contexts (e.g., Frechet bounds) but we interpret and apply them in a manner that is specifically focused on assurance arguments and requires no background in probabilistic analysis. We show that the method is not susceptible to the counterexamples that Graydon and Holloway exhibit for other approaches to confidence and we recommend it as an additional tool in evaluation of Assurance 2.0 arguments. The primary evaluation criteria for Assurance 2.0 remain logical indefeasibility and dialectical examination, but probabilistic assessment can be useful in evaluating cost/confidence tradeoffs for different risk levels, and the overall balance of confidence across a structured argument.

Subjects:

Software Engineering (cs.SE); Logic in Computer Science (cs.LO)

ACM classes: F.3.1; D.2.4

Report number: SRI-CSL-25-01R2

Cite as: arXiv:2604.00034 [cs.SE]

(or arXiv:2604.00034v1 [cs.SE] for this version)

https://doi.org/10.48550/arXiv.2604.00034

arXiv-issued DOI via DataCite

Submission history

From: John Rushby [view email] [v1] Sat, 21 Mar 2026 01:54:10 UTC (300 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

announcevaluationanalysis

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Quantifying…announcevaluationanalysissafetyarxivarXiv cs.SE

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 134 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Products