Locally Confident, Globally Stuck: The Quality-Exploration Dilemma in Diffusion Language Models
arXiv:2604.00375v1 Announce Type: new Abstract: Diffusion large language models (dLLMs) theoretically permit token decoding in arbitrary order, a flexibility that could enable richer exploration of reasoning paths than autoregressive (AR) LLMs. In practice, however, random-order decoding often hurts generation quality. To mitigate this, low-confidence remasking improves single-sample quality (e.g., Pass@$1$) by prioritizing confident tokens, but it also suppresses exploration and limits multi-sample gains (e.g., Pass@$k$), creating a fundamental quality--exploration dilemma. In this paper, we provide a unified explanation of this dilemma. We show that low-confidence remasking improves a myopic proxy for quality while provably constraining the entropy of the induced sequence distribution. T
View PDF HTML (experimental)
Abstract:Diffusion large language models (dLLMs) theoretically permit token decoding in arbitrary order, a flexibility that could enable richer exploration of reasoning paths than autoregressive (AR) LLMs. In practice, however, random-order decoding often hurts generation quality. To mitigate this, low-confidence remasking improves single-sample quality (e.g., Pass@$1$) by prioritizing confident tokens, but it also suppresses exploration and limits multi-sample gains (e.g., Pass@$k$), creating a fundamental quality--exploration dilemma. In this paper, we provide a unified explanation of this dilemma. We show that low-confidence remasking improves a myopic proxy for quality while provably constraining the entropy of the induced sequence distribution. To overcome this limitation, we characterize the optimal distribution that explicitly balances quality and exploration, and develop a simple Independent Metropolis--Hastings sampler that approximately targets this distribution during decoding. Experiments across a range of reasoning benchmarks including MATH500, AIME24/25, HumanEval, and MBPP show that our approach yields better exploration-quality tradeoff than both random and low-confidence remasking.
Subjects:
Computation and Language (cs.CL)
Cite as: arXiv:2604.00375 [cs.CL]
(or arXiv:2604.00375v1 [cs.CL] for this version)
https://doi.org/10.48550/arXiv.2604.00375
arXiv-issued DOI via DataCite (pending registration)
Submission history
From: Liancheng Fang [view email] [v1] Wed, 1 Apr 2026 02:01:30 UTC (314 KB)
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
modellanguage modelbenchmark
How to Access All AI Models with a Single API Key in 2026
You want to use GPT-5 for general tasks, Claude for coding, Gemini for long documents, and DeepSeek for cheap inference. That means four API keys, four billing accounts, four different SDKs, and four sets of rate limits to manage. There's a better way. Unified AI API gateways let you access all of these models — and hundreds more — through a single API key and endpoint. This guide shows you exactly how to set it up in under 5 minutes. The Problem with Multiple API Keys If you're calling AI models directly, your setup looks something like this: # The painful way — managing multiple clients import openai import anthropic import google.generativeai as genai openai_client = openai . OpenAI ( api_key = " sk-openai-... " ) anthropic_client = anthropic . Anthropic ( api_key = " sk-ant-... " ) gen

Software Testing Training in Kalyan Nagar – Learnmore Technologies
Launch a successful QA career with Software Testing Training at Learnmore Technologies, Kalyan Nagar. Our industry-focused program covers manual testing, automation testing, Selenium, test case design, and real-time project practice. Learn through hands-on sessions led by experienced trainers. Designed for freshers and professionals, our classroom online training equips you with job-ready testing skills and interview support. Call: 9036542555 Visit: https://learnmoretechnologies.in/software-testing-training-in-kalyan-nagar/
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.


Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!