Models model language model benchmark training announce global

Locally Confident, Globally Stuck: The Quality-Exploration Dilemma in Diffusion Language Models

arXiv cs.CLby Liancheng Fang, Aiwei Liu, Henry Peng Zou, Yankai Chen, Enze Ma, Leyi Pan, Chunyu Miao, Wei-Chieh Huang, Xue Liu, Philip S. YuApril 2, 20261 min read0 views

Source Quiz

arXiv:2604.00375v1 Announce Type: new Abstract: Diffusion large language models (dLLMs) theoretically permit token decoding in arbitrary order, a flexibility that could enable richer exploration of reasoning paths than autoregressive (AR) LLMs. In practice, however, random-order decoding often hurts generation quality. To mitigate this, low-confidence remasking improves single-sample quality (e.g., Pass@$1$) by prioritizing confident tokens, but it also suppresses exploration and limits multi-sample gains (e.g., Pass@$k$), creating a fundamental quality--exploration dilemma. In this paper, we provide a unified explanation of this dilemma. We show that low-confidence remasking improves a myopic proxy for quality while provably constraining the entropy of the induced sequence distribution. T

View PDF HTML (experimental)

Abstract:Diffusion large language models (dLLMs) theoretically permit token decoding in arbitrary order, a flexibility that could enable richer exploration of reasoning paths than autoregressive (AR) LLMs. In practice, however, random-order decoding often hurts generation quality. To mitigate this, low-confidence remasking improves single-sample quality (e.g., Pass@$1$) by prioritizing confident tokens, but it also suppresses exploration and limits multi-sample gains (e.g., Pass@$k$), creating a fundamental quality--exploration dilemma. In this paper, we provide a unified explanation of this dilemma. We show that low-confidence remasking improves a myopic proxy for quality while provably constraining the entropy of the induced sequence distribution. To overcome this limitation, we characterize the optimal distribution that explicitly balances quality and exploration, and develop a simple Independent Metropolis--Hastings sampler that approximately targets this distribution during decoding. Experiments across a range of reasoning benchmarks including MATH500, AIME24/25, HumanEval, and MBPP show that our approach yields better exploration-quality tradeoff than both random and low-confidence remasking.

Subjects:

Computation and Language (cs.CL)

Cite as: arXiv:2604.00375 [cs.CL]

(or arXiv:2604.00375v1 [cs.CL] for this version)

https://doi.org/10.48550/arXiv.2604.00375

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Liancheng Fang [view email] [v1] Wed, 1 Apr 2026 02:01:30 UTC (314 KB)

Original source

arXiv cs.CL

https://arxiv.org/abs/2604.00375

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modellanguage modelbenchmark

Models

Revolutionary: Mistral AI’s Voxtral TTS Disrupts Voice AI Market with Open-Source, Multi-Language Speech Model - CryptoRank

Revolutionary: Mistral AI’s Voxtral TTS Disrupts Voice AI Market with Open-Source, Multi-Language Speech Model CryptoRank

GNews AI voice

1m7 days ago

ModelsLive

How to Access All AI Models with a Single API Key in 2026

You want to use GPT-5 for general tasks, Claude for coding, Gemini for long documents, and DeepSeek for cheap inference. That means four API keys, four billing accounts, four different SDKs, and four sets of rate limits to manage. There's a better way. Unified AI API gateways let you access all of these models — and hundreds more — through a single API key and endpoint. This guide shows you exactly how to set it up in under 5 minutes. The Problem with Multiple API Keys If you're calling AI models directly, your setup looks something like this: # The painful way — managing multiple clients import openai import anthropic import google.generativeai as genai openai_client = openai . OpenAI ( api_key = " sk-openai-... " ) anthropic_client = anthropic . Anthropic ( api_key = " sk-ant-... " ) gen

Dev.to AI

7m23 minutes ago

ModelsLive

Software Testing Training in Kalyan Nagar – Learnmore Technologies

Launch a successful QA career with Software Testing Training at Learnmore Technologies, Kalyan Nagar. Our industry-focused program covers manual testing, automation testing, Selenium, test case design, and real-time project practice. Learn through hands-on sessions led by experienced trainers. Designed for freshers and professionals, our classroom online training equips you with job-ready testing skills and interview support. Call: 9036542555 Visit: https://learnmoretechnologies.in/software-testing-training-in-kalyan-nagar/

Dev.to AI

1m19 minutes ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 228 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

Locally Confident, Globally Stuck: The Quality-Exploration Dilemma in Diffusion Language Models

Submission history

Daily AI Digest

More about

Revolutionary: Mistral AI’s Voxtral TTS Disrupts Voice AI Market with Open-Source, Multi-Language Speech Model - CryptoRank

How to Access All AI Models with a Single API Key in 2026

Software Testing Training in Kalyan Nagar – Learnmore Technologies

Knowledge Map

Connected Articles — Knowledge Graph

Discussion

More in Models

More than 50% of enterprise software could switch to AI, Mistral CEO says - CNBC

Mistral’s revenues soar over $400mn as Europe seeks AI independence - Financial Times

Mistral AI Lands Accenture as Latest Big Client - WSJ

Mistral AI Raises $830 Million in Debt For Nvidia-Powered Data Center - WSJ