$R_{dm}$: Re-conceptualizing Distribution Matching as a Reward for Diffusion Distillation
arXiv:2603.28460v1 Announce Type: cross Abstract: Diffusion models achieve state-of-the-art generative performance but are fundamentally bottlenecked by their slow iterative sampling process. While diffusion distillation techniques enable high-fidelity few-step generation, traditional objectives often restrict the student's performance by anchoring it solely to the teacher. Recent approaches have attempted to break this ceiling by integrating Reinforcement Learning (RL), typically through a simple summation of distillation and RL objectives. In this work, we propose a novel paradigm by reconce — Linqian Fan, Peiqin Sun, Tiancheng Wen, Shun Lu, Chengru Song
View PDF HTML (experimental)
Abstract:Diffusion models achieve state-of-the-art generative performance but are fundamentally bottlenecked by their slow iterative sampling process. While diffusion distillation techniques enable high-fidelity few-step generation, traditional objectives often restrict the student's performance by anchoring it solely to the teacher. Recent approaches have attempted to break this ceiling by integrating Reinforcement Learning (RL), typically through a simple summation of distillation and RL objectives. In this work, we propose a novel paradigm by reconceptualizing distribution matching as a reward, denoted as $R_{dm}$. This unified perspective bridges the algorithmic gap between Diffusion Matching Distillation (DMD) and RL, providing several key benefits. (1) Enhanced optimization stability: we introduce Group Normalized Distribution Matching (GNDM), which adapts standard RL group normalization to stabilize $R_{dm}$ estimation. By leveraging group-mean statistics, GNDM establishes a more robust and effective optimization direction. (2) Seamless reward integration: our reward-centric formulation inherently supports adaptive weighting mechanisms, allowing flexible combination of DMD with external reward models. (3) Improved sampling efficiency: by aligning with RL principles, the framework readily incorporates importance sampling (IS), leading to a significant boost in sampling efficiency. Extensive experiments demonstrate that GNDM outperforms vanilla DMD, reducing the FID by 1.87. Furthermore, our multi-reward variant, GNDMR, surpasses existing baselines by achieving a strong balance between aesthetic quality and fidelity, reaching a peak HPS of 30.37 and a low FID-SD of 12.21. Overall, $R_{dm}$ provides a flexible, stable, and efficient framework for real-time high-fidelity synthesis. Code will be released upon publication.
Subjects:
Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as: arXiv:2603.28460 [cs.CV]
(or arXiv:2603.28460v1 [cs.CV] for this version)
https://doi.org/10.48550/arXiv.2603.28460
arXiv-issued DOI via DataCite (pending registration)
Submission history
From: Linqian Fan [view email] [v1] Mon, 30 Mar 2026 14:01:31 UTC (9,573 KB)
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
researchpaperarxivHoward University Civil Engineering Research Team Uses AI to Help Address Climate Change Crises - The Dig at Howard University
<a href="https://news.google.com/rss/articles/CBMiygFBVV95cUxOM0hQd0xMUTRsbkR6OGNfUVJJREJGTXNPQ0FQREVkdTltbzR0dUJaZjJfX21zRUdadU4tdUhOUnpNMVNFOGcxOWh5bEh1dThaTjRnaXdKSTRuNXNXU29BTHRkdU1WaUc5U0JyUndMQ3EyMGdDU3hYYW1zU2ZlanlfS0llbXJqcS1XZXZFQXBWdUI2a2hPcnQ5azRvUE5ISlBxMjc1UWRNMjlSbE84SllaMFRNLVp4MjZtcDB6S0N1Y2N5WTZWU0FNQTR3?oc=5" target="_blank">Howard University Civil Engineering Research Team Uses AI to Help Address Climate Change Crises</a> <font color="#6f6f6f">The Dig at Howard University</font>
Findings from the AI Climate Hoax: What is the real climate impact of data centres? - Finextra Research
<a href="https://news.google.com/rss/articles/CBMiwwFBVV95cUxNYlEyeXg4dVpzSC1xZzdhUHRzdkJ5VkVuRF94MlZCbVVUZ3NmaEh6NUg5OHA2a3BZd3paQk85Rlo5Tm8xT1lwUUt0WHlZeU1lckw2NjZTZEpFM2NtQnVESi1FTnNzR2duYmdfTXMzMGhraEc3ZHN2a1I3cmVnZUQ3TnhZUGFLT29oNzJxRWdVOTdVM0E5NmNBZlo5RHR6em4tdmo5NmJDRjgzZVdRNUlXMDE0U2dSTy1XVE1nMmlUU0hGT1k?oc=5" target="_blank">Findings from the AI Climate Hoax: What is the real climate impact of data centres?</a> <font color="#6f6f6f">Finextra Research</font>
UTA opens AI-driven Smart Agriculture Research Center - uta.edu
<a href="https://news.google.com/rss/articles/CBMipgFBVV95cUxPUzFsREVuMVdwd0k5dGp6M2V5bW9sWkhDZlhEdENoZFQ0NHg2c2tWVWRrbW5PZ2Z3a3RFd3dleHJPZzZxMW5mZV9JUV9FYk55bVVHcXV5UzJiOTdsV2JfVWlnZE1xdVczSVh6RGQ4c2xDWkl3SS1zakVwNDZoOWNpVGRYTUVxTzREal94dk9BVnRWRzlQMi1UODJKLWkwc2RsOVdSOFZR?oc=5" target="_blank">UTA opens AI-driven Smart Agriculture Research Center</a> <font color="#6f6f6f">uta.edu</font>
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Research Papers
Oracle is cutting up to 30,000 employees to pay for AI data centres - The Next Web
<a href="https://news.google.com/rss/articles/CBMiY0FVX3lxTE5fcTM4eWtFeEtqcUUxY0ozek02THV2VElQTzcycDZ6aHg1X1NRdVgtcElJaWF2SmlOT3FHVkd4RGMwV3lUZUhTN1lWUmZtWm9RakttMG5oMktnRTQ5ODZ2X3RUQQ?oc=5" target="_blank">Oracle is cutting up to 30,000 employees to pay for AI data centres</a> <font color="#6f6f6f">The Next Web</font>
Howard University Civil Engineering Research Team Uses AI to Help Address Climate Change Crises - The Dig at Howard University
<a href="https://news.google.com/rss/articles/CBMiygFBVV95cUxOM0hQd0xMUTRsbkR6OGNfUVJJREJGTXNPQ0FQREVkdTltbzR0dUJaZjJfX21zRUdadU4tdUhOUnpNMVNFOGcxOWh5bEh1dThaTjRnaXdKSTRuNXNXU29BTHRkdU1WaUc5U0JyUndMQ3EyMGdDU3hYYW1zU2ZlanlfS0llbXJqcS1XZXZFQXBWdUI2a2hPcnQ5azRvUE5ISlBxMjc1UWRNMjlSbE84SllaMFRNLVp4MjZtcDB6S0N1Y2N5WTZWU0FNQTR3?oc=5" target="_blank">Howard University Civil Engineering Research Team Uses AI to Help Address Climate Change Crises</a> <font color="#6f6f6f">The Dig at Howard University</font>
Findings from the AI Climate Hoax: What is the real climate impact of data centres? - Finextra Research
<a href="https://news.google.com/rss/articles/CBMiwwFBVV95cUxNYlEyeXg4dVpzSC1xZzdhUHRzdkJ5VkVuRF94MlZCbVVUZ3NmaEh6NUg5OHA2a3BZd3paQk85Rlo5Tm8xT1lwUUt0WHlZeU1lckw2NjZTZEpFM2NtQnVESi1FTnNzR2duYmdfTXMzMGhraEc3ZHN2a1I3cmVnZUQ3TnhZUGFLT29oNzJxRWdVOTdVM0E5NmNBZlo5RHR6em4tdmo5NmJDRjgzZVdRNUlXMDE0U2dSTy1XVE1nMmlUU0hGT1k?oc=5" target="_blank">Findings from the AI Climate Hoax: What is the real climate impact of data centres?</a> <font color="#6f6f6f">Finextra Research</font>
UTA opens AI-driven Smart Agriculture Research Center - uta.edu
<a href="https://news.google.com/rss/articles/CBMipgFBVV95cUxPUzFsREVuMVdwd0k5dGp6M2V5bW9sWkhDZlhEdENoZFQ0NHg2c2tWVWRrbW5PZ2Z3a3RFd3dleHJPZzZxMW5mZV9JUV9FYk55bVVHcXV5UzJiOTdsV2JfVWlnZE1xdVczSVh6RGQ4c2xDWkl3SS1zakVwNDZoOWNpVGRYTUVxTzREal94dk9BVnRWRzlQMi1UODJKLWkwc2RsOVdSOFZR?oc=5" target="_blank">UTA opens AI-driven Smart Agriculture Research Center</a> <font color="#6f6f6f">uta.edu</font>

Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!