Beyond AI advice -- independent aggregation boosts human-AI accuracy

arXiv cs.HCby Julian Berger, Pantelis P. Analytis, Ville Satop\"a\"a, Ralf H. J. M. KurversApril 1, 20262 min read0 views

Source Quiz

arXiv:2603.29866v1 Announce Type: new Abstract: Artificial intelligence (AI) is broadly deployed as an advisor to human decision-makers: AI recommends a decision and a human accepts or rejects the advice. This approach, however, has several limitations: People frequently ignore accurate advice and rely too much on inaccurate advice, and their decision-making skills may deteriorate over time. Here, we compare the AI-as-advisor approach to the hybrid confirmation tree (HCT), an alternative strategy that preserves the independence of human and AI judgments. The HCT elicits a human judgment and an AI judgment independently of each other. If they agree, that decision is accepted. If not, a second human breaks the tie. For the comparison, we used 10 datasets from various domains, including medic

View PDF HTML (experimental)

Abstract:Artificial intelligence (AI) is broadly deployed as an advisor to human decision-makers: AI recommends a decision and a human accepts or rejects the advice. This approach, however, has several limitations: People frequently ignore accurate advice and rely too much on inaccurate advice, and their decision-making skills may deteriorate over time. Here, we compare the AI-as-advisor approach to the hybrid confirmation tree (HCT), an alternative strategy that preserves the independence of human and AI judgments. The HCT elicits a human judgment and an AI judgment independently of each other. If they agree, that decision is accepted. If not, a second human breaks the tie. For the comparison, we used 10 datasets from various domains, including medical diagnostics and misinformation discernment, and a subset of four datasets in which AI also explained its decision. The HCT outperformed the AI-as-advisor approach in all datasets. The HCT also performed better in almost all cases in which AI offered an explanation of its judgment. Using signal detection theory to interpret these results, we find that the HCT outperforms the AI-as-advisor approach because people cannot discriminate well enough between correct and incorrect AI advice. Overall, the HCT is a robust, accurate, and transparent alternative to the AI-as-advisor approach, offering a simple mechanism to tap into the wisdom of hybrid crowds.

Subjects:

Human-Computer Interaction (cs.HC)

Cite as: arXiv:2603.29866 [cs.HC]

(or arXiv:2603.29866v1 [cs.HC] for this version)

https://doi.org/10.48550/arXiv.2603.29866

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Julian Berger [view email] [v1] Tue, 31 Mar 2026 15:20:47 UTC (1,890 KB)

Original source

arXiv cs.HC

https://arxiv.org/abs/2603.29866

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

announcearxiv

ReleasesFresh

Better Connected: tap-and-go travel across trains, trams and buses announced in government's new transport strategy

Strategy sets the direction for a more joined-up transport network that works better for passengers, drivers and communities across England.

UK Gov AI News

1mabout 3 hours ago

AI ToolsFresh

Credible Uncertainty Quantification under Noise and System Model Mismatch

arXiv:2509.03311v5 Announce Type: replace Abstract: State estimators often provide self-assessed uncertainty metrics, such as covariance matrices, whose credibility is critical for downstream tasks. However, these self-assessments can be misleading due to underlying modeling violations like noise model mismatch (NMM) or system model misspecification (SMM). This letter addresses this problem by developing a unified, multi-metric framework that integrates noncredibility index (NCI), negative log-likelihood (NLL), and energy score (ES) metrics, featuring an empirical location test (ELT) to detect system model bias and a directional probing technique that uses the metrics' asymmetric sensitivities to distinguish NMM from SMM. Monte Carlo simulations reveal that the proposed method achieves exc

arXiv eess.SP

1mabout 5 hours ago

ReleasesFresh

Data-Model Co-Driven Continuous Channel Map Construction: A Perceptive Foundation for Embodied Intelligent Agents in 6G Networks

arXiv:2604.01060v1 Announce Type: new Abstract: Future 6G networks will host massive numbers of embodied intelligent agents, which require real-time channel awareness over continuous-space for autonomous decision-making. By pre-obtaining location-specific channel state information (CSI), channel map can be served as a foundational world model for embodied intelligence to achieve wireless channel perception. However, acquiring CSI via measurements is costly, so in practice only sparse observations are available, leaving agents blind to channel conditions at unvisited locations. Meanwhile, purely model-driven channel maps can provide dense CSI but often yields unsatisfactory accuracy and robustness, while purely data-driven interpolation from sparse measurements is computationally prohibitiv

arXiv eess.SP

2mabout 5 hours ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 185 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Releases

ReleasesRecent

PEDIGREE® launches AI tool to facilitate responsible dog adoption in Brazil - Marketing Communication News

PEDIGREE® launches AI tool to facilitate responsible dog adoption in Brazil Marketing Communication News

GNews AI Brazil

1mabout 17 hours ago

ReleasesFresh

Better Connected: tap-and-go travel across trains, trams and buses announced in government's new transport strategy

Strategy sets the direction for a more joined-up transport network that works better for passengers, drivers and communities across England.

UK Gov AI News

1mabout 3 hours ago

ReleasesLive

I Built 25 Cloudflare Workers APIs — Here's What I Learned

Over the past few months, I built and deployed 25 APIs on Cloudflare Workers . All running on the free tier. Total monthly hosting cost: $0 . Here's what I learned about building, deploying, and monetizing utility APIs at scale. The Stack Every API follows the same pattern: worker-api/ ├── src/ │ └── index.js # Single entry point ├── wrangler.toml # Cloudflare config └── package.json No frameworks. No bundlers. Just vanilla JavaScript on Cloudflare Workers. The 25 APIs Here's a sampling of what I built: API Purpose Complexity Readability Score Text analysis (Flesch-Kincaid, SMOG, ARI) Medium QR Code Generator Generate QR codes from text/URL Low Password Generator Cryptographically secure passwords Low Markdown to HTML CommonMark-compliant conversion Medium Color Converter HEX/RGB/HSL conve

DEV Community

3m42 minutes ago

ReleasesLive

Image Optimisation Strategies for Better LCP Scores

On many marketing and product pages, Largest Contentful Paint (LCP) is not abstract. It is a hero photograph, a product shot, or a full-width banner. The metric tracks when that largest visible element finishes rendering; if the element is an image, your optimisation work is mostly bytes, dimensions, and discovery order —not another round of “general speed tips”. This guide assumes you already know what LCP measures. If you need the full picture first, read What Are Core Web Vitals? A Practical Guide for 2026 and LCP, INP, CLS: What Each Core Web Vital Means and How to Fix It . Here we go deep on image-specific strategies that move LCP toward the “good” band (≤ 2.5 seconds in the field), and how to pair them with performance budgets so improvements stick. Start by identifying the real LCP

DEV Community

9m41 minutes ago