Research Papers research paper arxiv machine-learning deep-learning

Unrestrained Simplex Denoising for Discrete Data. A Non-Markovian Approach Applied to Graph Generation

arXivMarch 31, 202610 min read0 views

arXiv:2603.28572v1 Announce Type: new Abstract: Denoising models such as Diffusion or Flow Matching have recently advanced generative modeling for discrete structures, yet most approaches either operate directly in the discrete state space, causing abrupt state changes. We introduce simplex denoising, a simple yet effective generative framework that operates on the probability simplex. The key idea is a non-Markovian noising scheme in which, for a given clean data point, noisy representations at different times are conditionally independent. While preserving the theoretical guarantees of denoi — Yoann Boget, Alexandros Kalousis

View PDF HTML (experimental)

Abstract:Denoising models such as Diffusion or Flow Matching have recently advanced generative modeling for discrete structures, yet most approaches either operate directly in the discrete state space, causing abrupt state changes. We introduce simplex denoising, a simple yet effective generative framework that operates on the probability simplex. The key idea is a non-Markovian noising scheme in which, for a given clean data point, noisy representations at different times are conditionally independent. While preserving the theoretical guarantees of denoising-based generative models, our method removes unnecessary constraints, thereby improving performance and simplifying the formulation. Empirically, \emph{unrestrained simplex denoising} surpasses strong discrete diffusion and flow-matching baselines across synthetic and real-world graph benchmarks. These results highlight the probability simplex as an effective framework for discrete generative modeling.

Comments: Simplex Denoising

Subjects:

Machine Learning (cs.LG)

Cite as: arXiv:2603.28572 [cs.LG]

(or arXiv:2603.28572v1 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2603.28572

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Yoann Boget [view email] [v1] Mon, 30 Mar 2026 15:26:56 UTC (6,496 KB)

Original source

arXiv

https://arxiv.org/abs/2603.28572

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Research PapersRecent

Do Phone-Use Agents Respect Your Privacy?

We study whether phone-use agents respect privacy while completing benign mobile tasks. This question has remained hard to answer because privacy-compliant behavior is not operationalized for phone-use agents, and ordinary apps do not reveal exactly what data agents type into which form entries during execution. To make this question measurable, we introduce MyPhoneBench, a verifiable evaluation framework for privacy behavior in mobile agents. We operationalize privacy-respecting phone use as pe... (3 upvotes on HuggingFace)

HuggingFace Papers

2m1 day ago

Research Papers

Friends and Grandmothers in Silico: Localizing Entity Cells in Language Models

Entity-centric factual question answering involves localized MLP neurons that can be causally intervened to recover entity-consistent predictions, showing robustness to various linguistic variations but with limited universality across all entities. (0 upvotes on HuggingFace)

HuggingFace Papers

2m2 days ago

Research PapersRecent

Apriel-Reasoner: RL Post-Training for General-Purpose and Efficient Reasoning

Apriel-Reasoner is a 15B-parameter language model trained with reproducible multi-domain reinforcement learning to improve reasoning efficiency and accuracy across diverse tasks while reducing inference costs. (1 upvotes on HuggingFace)

HuggingFace Papers

2m1 day ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 203 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Research Papers

Research PapersRecent

Do Phone-Use Agents Respect Your Privacy?

HuggingFace Papers

2m1 day ago

Research Papers

Friends and Grandmothers in Silico: Localizing Entity Cells in Language Models

HuggingFace Papers

2m2 days ago

Research PapersRecent

Automatic Image-Level Morphological Trait Annotation for Organismal Images

Sparse autoencoders trained on foundation-model features produce monosemantic neurons that enable scalable extraction of morphological traits from biological images through a modular annotation pipeline. (1 upvotes on HuggingFace)

HuggingFace Papers

2m1 day ago

Research Papers

Executing as You Generate: Hiding Execution Latency in LLM Code Generation

Parallel execution paradigm for LLM-based coding agents reduces latency by executing code during generation rather than in sequential stages. (1 upvotes on HuggingFace)

HuggingFace Papers

2m2 days ago