Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessBoston Becomes First Major District to Bring AI Literacy Into Classrooms - GoverningGoogle News: AIHow payment fraud evolved from ancient Roman coins to AI-deepfakes — and what's next - The Business JournalsGNews AI deepfakeOracle Lays Off Thousands to Offset AI SpendingGizmodoFranklin Templeton agrees to acquire CoinFund spinoff 250 Digital to form Franklin Crypto, which will offer strategies designed for institutional investors (Vicky Ge Huang/Wall Street Journal)TechmemeDeveloper’s Guide to Building ADK Agents with SkillsGoogle Developers BlogUMW Inaugural AI Expert-in-Residence Shares Insight on Technology’s ‘Tremendous’ Impact - University of Mary WashingtonGoogle News: AISpaceX Said to File Confidentially for IPO Before AI RivalsBloomberg TechnologyCargill Wins 2026 BIG Artificial Intelligence Excellence Award - foodmarket.comGoogle News: AIWhen machines judge without knowing: AI, augmentation and the limits of automated cybersecurity decisions - IAPPGNews AI cybersecurityMeet the Agentic AI Design-to-Source Workspace for PLM: From CAD to Confident Sourcing Decisions - Oracle BlogsGNews AI agenticYouTube blasted by hundreds of experts over ‘AI slop’ videos served up to kidsFast Company TechApono Uses Gamified AI Security Exercise to Engage Cloud Security Community - TipRanksGoogle News: AI SafetyBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessBoston Becomes First Major District to Bring AI Literacy Into Classrooms - GoverningGoogle News: AIHow payment fraud evolved from ancient Roman coins to AI-deepfakes — and what's next - The Business JournalsGNews AI deepfakeOracle Lays Off Thousands to Offset AI SpendingGizmodoFranklin Templeton agrees to acquire CoinFund spinoff 250 Digital to form Franklin Crypto, which will offer strategies designed for institutional investors (Vicky Ge Huang/Wall Street Journal)TechmemeDeveloper’s Guide to Building ADK Agents with SkillsGoogle Developers BlogUMW Inaugural AI Expert-in-Residence Shares Insight on Technology’s ‘Tremendous’ Impact - University of Mary WashingtonGoogle News: AISpaceX Said to File Confidentially for IPO Before AI RivalsBloomberg TechnologyCargill Wins 2026 BIG Artificial Intelligence Excellence Award - foodmarket.comGoogle News: AIWhen machines judge without knowing: AI, augmentation and the limits of automated cybersecurity decisions - IAPPGNews AI cybersecurityMeet the Agentic AI Design-to-Source Workspace for PLM: From CAD to Confident Sourcing Decisions - Oracle BlogsGNews AI agenticYouTube blasted by hundreds of experts over ‘AI slop’ videos served up to kidsFast Company TechApono Uses Gamified AI Security Exercise to Engage Cloud Security Community - TipRanksGoogle News: AI Safety

Optimizing Coverage and Difficulty in Reinforcement Learning for Quiz Composition

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2603.27695v1 Announce Type: new Abstract: Quiz design is a tedious process that teachers undertake to evaluate the acquisition of knowledge by students. Our goal in this paper is to automate quiz composition from a set of multiple choice questions (MCQs). We formalize a generic sequential decision-making problem with the goal of training an agent to compose a quiz that meets the desired topic coverage and difficulty levels. We investigate DQN, SARSA and A2C/A3C, three reinforcement learning solutions to solve our problem. We run extensive experiments on synthetic and real datasets that s — Ricardo Pedro Querido Andrade Silva, Nassim Bouarour, Dina Fettache, Sarab Boussouar, Noha Ibrahim, Sihem Amer-Yahia

View PDF HTML (experimental)

Abstract:Quiz design is a tedious process that teachers undertake to evaluate the acquisition of knowledge by students. Our goal in this paper is to automate quiz composition from a set of multiple choice questions (MCQs). We formalize a generic sequential decision-making problem with the goal of training an agent to compose a quiz that meets the desired topic coverage and difficulty levels. We investigate DQN, SARSA and A2C/A3C, three reinforcement learning solutions to solve our problem. We run extensive experiments on synthetic and real datasets that study the ability of RL to land on the best quiz. Our results reveal subtle differences in agent behavior and in transfer learning with different data distributions and teacher goals. This was supported by our user study, paving the way for automating various teachers' pedagogical goals.

Subjects:

Machine Learning (cs.LG)

Cite as: arXiv:2603.27695 [cs.LG]

(or arXiv:2603.27695v1 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2603.27695

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Noha Ibrahim [view email] [v1] Sun, 29 Mar 2026 13:46:02 UTC (15,000 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Optimizing …researchpaperarxivmachine-lea…deep-learni…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 81 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers