Releases model announce available application arxiv

Bayesian model-averaging stochastic item selection for adaptive testing

arXiv stat.MLby [Submitted on 22 Apr 2025 (v1), last revised 31 Mar 2026 (this version, v3)]April 1, 20262 min read1 views

Source Quiz

arXiv:2504.15543v3 Announce Type: replace-cross Abstract: Computer Adaptive Testing (CAT) aims to accurately estimate an individual's ability using only a subset of an Item Response Theory (IRT) instrument. Many applications also require diverse item exposure across testing sessions, preventing any single item from being over- or underutilized. In CAT, items are selected sequentially based on a running estimate of a respondent's ability. Prior methods almost universally see item selection through an optimization lens, motivating greedy item selection procedures. While efficient, these deterministic methods tend to have poor item exposure. Existing stochastic methods for item selection are ad-hoc, with item sampling weights that lack theoretical justification. We formulate stochastic CAT as

View PDF HTML (experimental)

Abstract:Computer Adaptive Testing (CAT) aims to accurately estimate an individual's ability using only a subset of an Item Response Theory (IRT) instrument. Many applications also require diverse item exposure across testing sessions, preventing any single item from being over- or underutilized. In CAT, items are selected sequentially based on a running estimate of a respondent's ability. Prior methods almost universally see item selection through an optimization lens, motivating greedy item selection procedures. While efficient, these deterministic methods tend to have poor item exposure. Existing stochastic methods for item selection are ad-hoc, with item sampling weights that lack theoretical justification. We formulate stochastic CAT as a Bayesian model averaging problem. We seek item sampling probabilities, treated in the long-run frequentist sense, that perform optimal model averaging for the ability estimate in a Bayesian sense. The derivation yields an information criterion for optimal stochastic mixing: the expected entropy of the next posterior. We tested our method on seven publicly available psychometric instruments spanning personality, social attitudes, narcissism, and work preferences, in addition to the eight scales of the Work Disability Functional Assessment Battery. Across all instruments, accuracy differences between selection methods at a given test length are varied but minimal relative to the natural noise in ability estimation; however, the stochastic selector achieves full item bank exposure, resolving the longstanding tradeoff between measurement efficiency and item security at negligible accuracy cost.

Comments: Under review; major revision

Subjects:

Methodology (stat.ME); Information Theory (cs.IT); Machine Learning (stat.ML)

Cite as: arXiv:2504.15543 [stat.ME]

(or arXiv:2504.15543v3 [stat.ME] for this version)

https://doi.org/10.48550/arXiv.2504.15543

arXiv-issued DOI via DataCite

Submission history

From: Joshua Chang [view email] [v1] Tue, 22 Apr 2025 02:45:16 UTC (4,297 KB) [v2] Mon, 17 Nov 2025 19:54:39 UTC (4,604 KB) [v3] Tue, 31 Mar 2026 03:16:30 UTC (3,221 KB)

Original source

arXiv stat.ML

https://arxiv.org/abs/2504.15543

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modelannounceavailable

Models

vLLM-Omni Open Source: Integrating Diffusion Models, ViT, and LLM into a Pipeline, Completing Multimodal Inference in One Go - AIBase

vLLM-Omni Open Source: Integrating Diffusion Models, ViT, and LLM into a Pipeline, Completing Multimodal Inference in One Go AIBase

GNews AI diffusion

1m4 months ago

Open Source AILive

quarkus-chat-ui: A Web Front-End for LLMs, and a Real-World Case for POJO-actor

Note: This article was originally published on SciVicsLab . quarkus-chat-ui: A Web Front-End for LLMs, and a Real-World Case for POJO-actor quarkus-chat-ui is a web UI for LLMs where multiple instances can talk to each other — built as a real-world use case for POJO-actor . Each quarkus-chat-ui instance exposes an HTTP MCP server at /mcp , so Instance A can call tools on Instance B, and Instance B can reply by calling tools back on A. The LLM backend — Claude Code CLI, Codex, or a local model via claw-code-local — acts as an MCP client that can reach these endpoints. The question was how to wire that up over HTTP, and how to handle the fact that LLM responses take tens of seconds and arrive as a stream. quarkus-chat-ui is the bridge that makes this work. Each instance wraps one LLM backend

DEV Community

10mabout 1 hour ago

Open Source AILive

I'm under 18, broke, and I just designed an open-source AI chip. Here's the full story.

I don't have a team. I don't have funding. I don't have a lab. I have a laptop, an internet connection, and an obsession with chips. This is the story of T1C — Tier 1 Chip — and why I built it. It started with a frustration. Every time I read about AI hardware, it was the same story. NVIDIA charges $30,000 for an H100. TSMC charges millions for a custom fab run. Apple Silicon is beautiful but completely closed. Intel, Qualcomm, AMD — all of them — locked behind NDAs, closed architectures, and billion-dollar relationships. I kept thinking: why does no one make an open-source AI chip that a real person can actually fabricate? Not a toy. Not a demo. A real architecture with real specs, real physics, and a real path to silicon. So I built one. T1C uses Digital In-Memory Computing — D-IMC. Inst

DEV Community

5mabout 1 hour ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 139 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

Bayesian model-averaging stochastic item selection for adaptive testing

Submission history

Daily AI Digest

More about

vLLM-Omni Open Source: Integrating Diffusion Models, ViT, and LLM into a Pipeline, Completing Multimodal Inference in One Go - AIBase

quarkus-chat-ui: A Web Front-End for LLMs, and a Real-World Case for POJO-actor

I'm under 18, broke, and I just designed an open-source AI chip. Here's the full story.

Knowledge Map

Connected Articles — Knowledge Graph

Discussion

More in Releases

New UK Centre for AI-Driven Innovation launched at World Economic Forum - Imperial College London

The Colorado AI Policy Work Group Proposes an Updated Framework to Replace the Colorado AI Act - mayerbrown.com

ZenaTech (ZENA) Is Up 8.7% After Launching Ukraine Drone Hub And Expanding AI Defense Platform - simplywall.st

Brazil government to release 104 million reais for 6G, AI and cloud - BNamericas