Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessInside the push to make every employee an AI masterBusiness InsiderHow Rust's Ownership Model Prevents Bugs — A Visual GuideDEV CommunityThe Eve of Gentle Singularity: A Short StoryLessWrong AIAnthropic releases part of AI tool source code in 'error'TechXplore AIPrograms Beat Prompts: AI Forges Deterministic Interface Programs That Run ForeverDEV CommunityThe new American Dream: owning just part of a homeBusiness InsiderHow to stay relevant as a developerDEV CommunityI Built 24+ Free Developer Tools That Run in Your Browser — Here's the Full StackDEV CommunityMCMC Island Hopping: An Intuitive Guide to the Metropolis-Hastings AlgorithmDEV CommunityThe Iran war could haunt grocery bills long after the fighting stopsBusiness InsiderOracle cut thousands of jobs in recent round of layoffs – CNBCSilicon RepublicAnthropic admits partial leak of Claude Code source, says no customer data exposed - Storyboard18Google News: ClaudeBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessInside the push to make every employee an AI masterBusiness InsiderHow Rust's Ownership Model Prevents Bugs — A Visual GuideDEV CommunityThe Eve of Gentle Singularity: A Short StoryLessWrong AIAnthropic releases part of AI tool source code in 'error'TechXplore AIPrograms Beat Prompts: AI Forges Deterministic Interface Programs That Run ForeverDEV CommunityThe new American Dream: owning just part of a homeBusiness InsiderHow to stay relevant as a developerDEV CommunityI Built 24+ Free Developer Tools That Run in Your Browser — Here's the Full StackDEV CommunityMCMC Island Hopping: An Intuitive Guide to the Metropolis-Hastings AlgorithmDEV CommunityThe Iran war could haunt grocery bills long after the fighting stopsBusiness InsiderOracle cut thousands of jobs in recent round of layoffs – CNBCSilicon RepublicAnthropic admits partial leak of Claude Code source, says no customer data exposed - Storyboard18Google News: Claude

CoE: Collaborative Entropy for Uncertainty Quantification in Agentic Multi-LLM Systems

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2603.28360v1 Announce Type: new Abstract: Uncertainty estimation in multi-LLM systems remains largely single-model-centric: existing methods quantify uncertainty within each model but do not adequately capture semantic disagreement across models. To address this gap, we propose Collaborative Entropy (CoE), a unified information-theoretic metric for semantic uncertainty in multi-LLM collaboration. CoE is defined on a shared semantic cluster space and combines two components: intra-model semantic entropy and inter-model divergence to the ensemble mean. CoE is not a weighted ensemble predic — Kangkang Sun, Jun Wu, Jianhua Li, Minyi Guo, Xiuzhen Che, Jianwei Huang

View PDF HTML (experimental)

Abstract:Uncertainty estimation in multi-LLM systems remains largely single-model-centric: existing methods quantify uncertainty within each model but do not adequately capture semantic disagreement across models. To address this gap, we propose Collaborative Entropy (CoE), a unified information-theoretic metric for semantic uncertainty in multi-LLM collaboration. CoE is defined on a shared semantic cluster space and combines two components: intra-model semantic entropy and inter-model divergence to the ensemble mean. CoE is not a weighted ensemble predictor; it is a system-level uncertainty measure that characterizes collaborative confidence and disagreement. We analyze several core properties of CoE, including non-negativity, zero-value certainty under perfect semantic consensus, and the behavior of CoE when individual models collapse to delta distributions. These results clarify when reducing per-model uncertainty is sufficient and when residual inter-model disagreement remains. We also present a simple CoE-guided, training-free post-hoc coordination heuristic as a practical application of the metric. Experiments on \textit{TriviaQA} and \textit{SQuAD} with LLaMA-3.1-8B-Instruct, Qwen-2.5-7B-Instruct, and Mistral-7B-Instruct show that CoE provides stronger uncertainty estimation than standard entropy- and divergence-based baselines, with gains becoming larger as additional heterogeneous models are introduced. Overall, CoE offers a useful uncertainty-aware perspective on multi-LLM collaboration.

Comments: 18 pages, 7 figures, has already published in ICLR workshop "Agentic AI in the Wild: From Hallucinations to Reliable Autonomy"

Subjects:

Artificial Intelligence (cs.AI)

MSC classes: I.2.7, I.2.6

Cite as: arXiv:2603.28360 [cs.AI]

(or arXiv:2603.28360v1 [cs.AI] for this version)

https://doi.org/10.48550/arXiv.2603.28360

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Sun Kangkang [view email] [v1] Mon, 30 Mar 2026 12:28:26 UTC (10,776 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
CoE: Collab…researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 204 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers