MM-DADM: Multimodal Drug-Aware Diffusion Model for Virtual Clinical Trials
arXiv:2502.07297v3 Announce Type: replace Abstract: High failure rates in cardiac drug development necessitate virtual clinical trials via electrocardiogram (ECG) generation to reduce risks and costs. However, existing ECG generation models struggle to balance morphological realism with pathological flexibility, fail to disentangle demographics from genuine drug effects, and are severely bottlenecked by early-phase data scarcity. To overcome these hurdles, we propose the Multimodal Drug-Aware Diffusion Model (MM-DADM), the first generative framework for generating individualized drug-induced E — Qian Shao, Bang Du, Zepeng Li, Qiyuan Chen, Jiahe Chen, Hongxia Xu, Jimeng Sun, Jian Wu, Jintai Chen
View PDF HTML (experimental)
Abstract:High failure rates in cardiac drug development necessitate virtual clinical trials via electrocardiogram (ECG) generation to reduce risks and costs. However, existing ECG generation models struggle to balance morphological realism with pathological flexibility, fail to disentangle demographics from genuine drug effects, and are severely bottlenecked by early-phase data scarcity. To overcome these hurdles, we propose the Multimodal Drug-Aware Diffusion Model (MM-DADM), the first generative framework for generating individualized drug-induced ECGs. Specifically, our proposed MM-DADM integrates a Dynamic Cross-Attention (DCA) module that adaptively fuses External Physical Knowledge (EPK) to preserve morphological realism while avoiding the suppression of complex pathological nuances. To resolve feature entanglement, a Causal Feature Encoder (CFE) actively filters out demographic noise to extract pure pharmacological representations. These representations subsequently guide a Causal-Disentangled ControlNet (CDC-Net), which leverages counterfactual data augmentation to explicitly learn intrinsic pharmacological mechanisms despite limited clinical data. Extensive experiments on $9,443$ ECGs across $8$ drug regimens demonstrate that MM-DADM outperforms $10$ state-of-the-art ECG generation models, improving simulation accuracy by at least $6.13%$ and recall by $5.89%$, while providing highly effective data augmentation for downstream classification tasks.
Comments: Under review
Subjects:
Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
Cite as: arXiv:2502.07297 [cs.LG]
(or arXiv:2502.07297v3 [cs.LG] for this version)
https://doi.org/10.48550/arXiv.2502.07297
arXiv-issued DOI via DataCite
Submission history
From: Qian Shao [view email] [v1] Tue, 11 Feb 2025 06:50:33 UTC (2,862 KB) [v2] Sun, 18 May 2025 08:05:51 UTC (2,940 KB) [v3] Mon, 30 Mar 2026 10:51:39 UTC (2,987 KB)
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
researchpaperarxivExclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models - WSJ
<a href="https://news.google.com/rss/articles/CBMiuANBVV95cUxPTHlYUm1vbnBJRVd0a2VFOWVrNDJsMGxKZHZjUlJRc0wwLWxpNmJZVkZjcXo5dkViM0xKclVXbjFPS3BHUkZsNzVxbUgwUmJTZHlSVnkzSHQzc3BlS2toeUMzaHl6SUJjTnJ5ZHpJX3B5M3FfV3NmS1NKUVFRLWM5VVl0T2RmdjFpVnVzQkJFbG56MUFuRk1vWWhrZVR6LWRpYlNsZ0hUNWpZc1FYeGZWU2tidzc5WXdrZFFnUHBVRmZZRkFPY0ZKTVZJdnExQVhwY21yMy01QlRBUnJyWXFEd3gzOWNYSGZSd2xqcHV5aHJFcl9Mb0ZheFR6TmVzRE9NZGdvczNtRndfTmpEYXZHYlJCUkJmQ3daY2h3Zi1XcGxJaWF2bHo0WEwwSTZNMkhJeVpkN1NFQVU0dkFZbVE1bVlTT3ozay1aWVZjcndhaXBEdHAwSHlGYkRLdjlXQnNmSjUxa21iRGVEeEJmNDZGUTNxdG96OGFtUmxjVUNvamRoaGMxeGUzOEpsWGJTT0pjN1B1bkNVanlqaWd5QVVPWllVdERYVjMtaThMWlpFVUFOSWdxTWNDYw?oc=5" target="_blank">Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models</a> <font color="#6f6f6f">WSJ</font>
Perplexity launches Secure Intelligence Institute to advance AI security, privacy, and safety research - Moneycontrol.com
<a href="https://news.google.com/rss/articles/CBMi9AFBVV95cUxQMG5ITUdubnA4ankwSVo4djlMckJIM2V3cVpTSjA3ZFhVVTc0dHNCMlBJV3dNUVpsZ1lhcEM2aFpkLUs4Ym9IRXZqZFI4OVpwa3E3bnFvNS1uQk5vOVVJYnZ0TGNSQ080VlJERVNlaUt3WVE1WWJfUDlOS0JTOVc4Mk96Rmc1OEp2TmktOEVCODRqTEdpajBGb0JVQmUwNlVjdE96U3U5MERHQm1fWUFMbUhseXluVGZJaUdpQXQxT0s3SzdjdTVLampLbmI4Vnd1a0E3MC1VYVF0STBKTGNMZmhFaE9YVlF6X3dFWHJCenpsNzZW0gH6AUFVX3lxTE56aE9Dc3VlNGw4cGhFRzRHTmhuMXplZTY2QVBsYVJPWVhWclhWbWZLT2xtNllaOU1VMnRVYm9KZUV5anpUbW0wX011T01pcjFpT3hOcWFreFQ1bUVjdFoyZ0pGVGxsazNKRUxKVnAzU1dhbEV3ZmlRZmxYeFNkcmpkZXkyTEtjZXotZno3YjdWSW01RGo0R2RHWGIzMTh6VUpvUFBDbEt5ODAtYTZqOHl3ZzFMVkRLV0ZwVmp3ZWZFWW1VYVFyLTYxa3dvU2ozNEJqRENwcG45WEVJUmtZUVprTFdud3lWN1hHODRiZmtINlh4SmtSZVNhNWc?oc=5" target="_blank">Perplexity launches Secure Intelligence Institute to advance
Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models - WSJ
<a href="https://news.google.com/rss/articles/CBMiuANBVV95cUxQa2VLbUtXOHVWRUhpVXVlV1hxYWNWeWJpTkR1M2ZQLWNuWjVVLW9LX2tEWmF6RzRnaXQ3dnIzam9Vd1pmaWR4VlY3Z0V6SWVRMXl6UTJaZndWanl5bHE5N2RST3FpdmFMTld1TklVWU9KNW5fVWo0bEo4OW1ta0RkaVpLSUpISWNlQlM2QmVHSnhNVVFnem5RVm85M2lLekdVekxRd3ktanNrQ2hna1l1d19Xd3JOSVRLQ3BnbFZ3Q2xJUHNxWVZ4Wlc0ekFWN2oxdFBVTm8xWXBQY3k0T1FXM3BlU0NsbHYzR0UzUjRxRmRpODVDWktIeFUzaXJweDZ6WXc2VE0wYVI5MjdicVZaSXVsdmRUeUhXTmFSZFFiRzJHdnNzNzk3aEhVVDU2dHdlMXVEeHJQUTM3d2JtT3Fjb2NwUDUwdEtIMXR2VzNUMGI2NjVHMEMxcUwzNXE3VzFnZ1R4TE92cXhxU1d5MGdMZXRvQVNWdV83SF9aZFY5QXNxVHlkZ0o3MVZrXzVtb2hPYXZ6UmZfM1o3WkV6emwzdkpRLW5yLXRVcmZaSWdaeVpHZ09zX3pSeA?oc=5" target="_blank">Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models</a> <font color="#6f6f6f">WSJ</font>
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Research Papers

Multi-Layered Memory Architectures for LLM Agents: An Experimental Evaluation of Long-Term Context Retention
arXiv:2603.29194v1 Announce Type: new Abstract: Long-horizon dialogue systems suffer from semanticdrift and unstable memory retention across extended sessions. This paper presents a Multi-Layer Memory Framework that decomposes dialogue history into working, episodic, and semantic layers with adaptive retrieval gating and retention regularization. The architecture controls cross-session drift while maintaining bounded context growth and computational efficiency. Experiments on LOCOMO, LOCCO, and LoCoMo show improved performance, achieving 46.85 Success Rate, 0.618 overall F1 with 0.594 multi-hop F1, and 56.90% six-period retention while reducing false memory rate to 5.1% and context usage to 58.40%. Results confirm enhanced long-term retention and reasoning stability under constrained conte

3D Architect: An Automated Approach to Three-Dimensional Modeling
arXiv:2603.29191v1 Announce Type: new Abstract: The aim of our paper is to render an object in 3-dimension using a set of its orthographic views. Corner detector (Harris Detector) is applied on the input views to obtain control points. These control points are projected perpendicular to respective views, in order to construct an envelope. A set of points describing the object in 3-dimension, are obtained from the intersection of these mutually perpendicular envelopes. These set of points are used to regenerate the surfaces of the object using computational geometry. At the end, the object in 3-dimension is rendered using OpenGL

SLVMEval: Synthetic Meta Evaluation Benchmark for Text-to-Long Video Generation
arXiv:2603.29186v1 Announce Type: new Abstract: This paper proposes the synthetic long-video meta-evaluation (SLVMEval), a benchmark for meta-evaluating text-to-video (T2V) evaluation systems. The proposed SLVMEval benchmark focuses on assessing these systems on videos of up to 10,486 s (approximately 3 h). The benchmark targets a fundamental requirement, namely, whether the systems can accurately assess video quality in settings that are easy for humans to assess. We adopt a pairwise comparison-based meta-evaluation framework. Building on dense video-captioning datasets, we synthetically degrade source videos to create controlled "high-quality versus low-quality" pairs across 10 distinct aspects. Then, we employ crowdsourcing to filter and retain only those pairs in which the degradation

Developing a Guideline for the Labovian-Structural Analysis of Oral Narratives in Japanese
arXiv:2603.29347v1 Announce Type: new Abstract: Narrative analysis is a cornerstone of qualitative research. One leading approach is the Labovian model, but its application is labor-intensive, requiring a holistic, recursive interpretive process that moves back and forth between individual parts of the transcript and the transcript as a whole. Existing Labovian datasets are available only in English, which differs markedly from Japanese in terms of grammar and discourse conventions. To address this gap, we introduce the first systematic guidelines for Labovian narrative analysis of Japanese narrative data. Our guidelines retain all six Labovian categories and extend the framework by providing explicit rules for clause segmentation tailored to Japanese constructions. In addition, our guidel

Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!