Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessSam Altman's Sister Amends Lawsuit Accusing OpenAI CEO of Sexual Abuse - GV WireGoogle News: OpenAI‘System failure’ paralyzes Baidu robotaxis in ChinaTechCrunch AIThe Perils of AI-Generated Legal Advice for Dealers and Finance Companies - JD SupraGoogle News: Generative AICrack ML Interviews with Confidence: Anomaly Detection (20 Q&A)Towards AIMicrosoft CFO’s AI Spending Runs Up Against Tech Bubble FearsBloomberg TechnologyWhy Traditional Defenses Can’t Hide AI Traffic Patterns - Security BoulevardGoogle News: Machine LearningHow We Built an EdTech Platform That Scaled to 250K Daily UsersDEV CommunityClaude Code leak puts Anthropic on the other side of the copyright battleBusiness InsiderPrivate equity-backed cardiology practice adding new in-house smart lab powered by AI - cardiovascularbusiness.comGoogle News: AIBuilding Trust in Generative AI Together: Cisco’s Role in the NIST GenAI Program - Cisco BlogsGoogle News: Generative AIAnthropic Gets a Taste of Its Own Medicine - businessinsider.comGoogle News: ClaudeRoguelike Devlog: Redesigning a Game UI With an AI 2D Game MakerDEV CommunityBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessSam Altman's Sister Amends Lawsuit Accusing OpenAI CEO of Sexual Abuse - GV WireGoogle News: OpenAI‘System failure’ paralyzes Baidu robotaxis in ChinaTechCrunch AIThe Perils of AI-Generated Legal Advice for Dealers and Finance Companies - JD SupraGoogle News: Generative AICrack ML Interviews with Confidence: Anomaly Detection (20 Q&A)Towards AIMicrosoft CFO’s AI Spending Runs Up Against Tech Bubble FearsBloomberg TechnologyWhy Traditional Defenses Can’t Hide AI Traffic Patterns - Security BoulevardGoogle News: Machine LearningHow We Built an EdTech Platform That Scaled to 250K Daily UsersDEV CommunityClaude Code leak puts Anthropic on the other side of the copyright battleBusiness InsiderPrivate equity-backed cardiology practice adding new in-house smart lab powered by AI - cardiovascularbusiness.comGoogle News: AIBuilding Trust in Generative AI Together: Cisco’s Role in the NIST GenAI Program - Cisco BlogsGoogle News: Generative AIAnthropic Gets a Taste of Its Own Medicine - businessinsider.comGoogle News: ClaudeRoguelike Devlog: Redesigning a Game UI With an AI 2D Game MakerDEV Community

Conformal Selective Prediction with General Risk Control

arXivby [Submitted on 25 Mar 2026]March 25, 20262 min read1 views
Source Quiz

In deploying artificial intelligence (AI) models, selective prediction offers the option to abstain from making a prediction when uncertain about model quality. To fulfill its promise, it is crucial to enforce strict and precise error control over cases where the model is trusted. We propose Selective Conformal Risk control with E-values (SCoRE), a new framework for deriving such decisions for any trained model and any user-defined, bounded and continuously-valued risk. SCoRE offers two types of guarantees on the risk among ``positive'' cases in which the system opts to trust the model. Built — Tian Bai, Ying Jin

View PDF HTML (experimental)

Abstract:In deploying artificial intelligence (AI) models, selective prediction offers the option to abstain from making a prediction when uncertain about model quality. To fulfill its promise, it is crucial to enforce strict and precise error control over cases where the model is trusted. We propose Selective Conformal Risk control with E-values (SCoRE), a new framework for deriving such decisions for any trained model and any user-defined, bounded and continuously-valued risk. SCoRE offers two types of guarantees on the risk among ``positive'' cases in which the system opts to trust the model. Built upon conformal inference and hypothesis testing ideas, SCoRE first constructs a class of (generalized) e-values, which are non-negative random variables whose product with the unknown risk has expectation no greater than one. Such a property is ensured by data exchangeability without requiring any modeling assumptions. Passing these e-values on to hypothesis testing procedures, we yield the binary trust decisions with finite-sample error control. SCoRE avoids the need of uniform concentration, and can be readily extended to settings with distribution shifts. We evaluate the proposed methods with simulations and demonstrate their efficacy through applications to error management in drug discovery, health risk prediction, and large language models.

Subjects:

Methodology (stat.ME); Machine Learning (cs.LG); Applications (stat.AP); Machine Learning (stat.ML)

Cite as: arXiv:2603.24704 [stat.ME]

(or arXiv:2603.24704v1 [stat.ME] for this version)

https://doi.org/10.48550/arXiv.2603.24704

arXiv-issued DOI via DataCite

Submission history

From: Ying Jin [view email] [v1] Wed, 25 Mar 2026 18:29:23 UTC (1,655 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Conformal S…researchpaperarxivstatisticsmachine-lea…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 188 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers