Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessWhich countries use ChatGPT the most? New study reveals top 5 - Deseret NewsGoogle News: ChatGPTOpenAI Is Letting Individuals Invest in Its $852 Billion Valuation—Here’s How - inc.comGoogle News: OpenAITransition From Data Scientist to Machine Learning Engineer 2026 Guide - Interview Kickstart Publishes New Career Guide - The Manila TimesGoogle News: Machine LearningValuations are 'Punchy': Salesforce's DrewsBloomberg TechnologyEarly AI Use Risks Children’s Development, Safety: UN - Mexico Business NewsGoogle News: AI SafetyAI blueprints can be stolen with a single small antennaTechXplore AIYou Have to Start Early in AI: Axiom Founder VenkatachalamBloomberg TechnologyAI and the Work-Product Doctrine: A New Frontier - callaborlaw.comGoogle News: AICompliance Policies: AI Policy & Upcoming Incident Response Plan Deadline - natlawreview.comGoogle News: AIIntegration in the Wealth Management Industry - wealthmanagement.comGoogle News: AI‘Boring’ Liberty Formula One Upgraded To Buy at Bank of AmericaBloomberg TechnologyCan You Run a Computer Without RAM? Surprisingly, Yes—But You’ll Be MiserableGizmodoBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessWhich countries use ChatGPT the most? New study reveals top 5 - Deseret NewsGoogle News: ChatGPTOpenAI Is Letting Individuals Invest in Its $852 Billion Valuation—Here’s How - inc.comGoogle News: OpenAITransition From Data Scientist to Machine Learning Engineer 2026 Guide - Interview Kickstart Publishes New Career Guide - The Manila TimesGoogle News: Machine LearningValuations are 'Punchy': Salesforce's DrewsBloomberg TechnologyEarly AI Use Risks Children’s Development, Safety: UN - Mexico Business NewsGoogle News: AI SafetyAI blueprints can be stolen with a single small antennaTechXplore AIYou Have to Start Early in AI: Axiom Founder VenkatachalamBloomberg TechnologyAI and the Work-Product Doctrine: A New Frontier - callaborlaw.comGoogle News: AICompliance Policies: AI Policy & Upcoming Incident Response Plan Deadline - natlawreview.comGoogle News: AIIntegration in the Wealth Management Industry - wealthmanagement.comGoogle News: AI‘Boring’ Liberty Formula One Upgraded To Buy at Bank of AmericaBloomberg TechnologyCan You Run a Computer Without RAM? Surprisingly, Yes—But You’ll Be MiserableGizmodo

LogicDiff: Logic-Guided Denoising Improves Reasoning in Masked Diffusion Language Models

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2603.26771v1 Announce Type: cross Abstract: Masked diffusion language models (MDLMs) generate text by iteratively unmasking tokens from a fully masked sequence, offering parallel generation and bidirectional context. However, their standard confidence-based unmasking strategy systematically defers high-entropy logical connective tokens, the critical branching points in reasoning chains, leading to severely degraded reasoning performance. We introduce LogicDiff, an inference-time method that replaces confidence-based unmasking with logic-role-guided unmasking. A lightweight classification — Shaik Aman

View PDF HTML (experimental)

Abstract:Masked diffusion language models (MDLMs) generate text by iteratively unmasking tokens from a fully masked sequence, offering parallel generation and bidirectional context. However, their standard confidence-based unmasking strategy systematically defers high-entropy logical connective tokens, the critical branching points in reasoning chains, leading to severely degraded reasoning performance. We introduce LogicDiff, an inference-time method that replaces confidence-based unmasking with logic-role-guided unmasking. A lightweight classification head (4.2M parameters, 0.05% of the base model) predicts the logical role of each masked position (premise, connective, derived step, conclusion, or filler) from the base model's hidden states with 98.4% accuracy. A dependency-ordered scheduler then unmasks tokens in logical dependency order: premises first, then connectives, then derived steps, then conclusions. Without modifying a single parameter of the base model and without any reinforcement learning or task-specific training, LogicDiff improves LLaDA-8B-Instruct accuracy from 22.0% to 60.7% on GSM8K (+38.7 percentage points) and from 23.6% to 29.2% on MATH-500 (+5.6 pp), with less than 6% speed overhead. Our results demonstrate that a substantial portion of the reasoning deficit in MDLMs is attributable to suboptimal token unmasking order, not to limitations of the model's learned representations.

Comments: 9 pages, 3 figures, 3 tables

Subjects:

Computation and Language (cs.CL); Machine Learning (cs.LG)

Cite as: arXiv:2603.26771 [cs.CL]

(or arXiv:2603.26771v1 [cs.CL] for this version)

https://doi.org/10.48550/arXiv.2603.26771

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Aman Shaik [view email] [v1] Tue, 24 Mar 2026 13:08:10 UTC (60 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
LogicDiff: …researchpaperarxivmachine-lea…deep-learni…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 132 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers