Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessLLM Context Windows: Managing Tokens in Production AI AppsDEV CommunityPgBouncer: Database Connection Pooling That Actually ScalesDEV CommunityHow to Choose The Best Test Management Software For Your TeamDEV CommunityWhy I Built Scenar.io - An AI-Powered DevOps Interview Practice ToolDEV CommunityOAuth 2.0 Flows Demystified: Authorization Code, PKCE, and Client CredentialsDEV CommunityAI Doesn't Fix Your Development Problems. It Accelerates Them.DEV CommunityWhat Gemma 4's multi-token prediction head actually means for your eval pipelineDEV CommunityThe 3-File Context Kit: Everything Your AI Needs to Understand Your ProjectDEV CommunityMicroservices Communication: REST, gRPC, and Message QueuesDEV Community10 LLM Engineering Concepts Explained in 10 Minutes - KDnuggetsGNews AI RAGSamsung forecasts record Q1 2026 profit, up eightfold, on AI chip demand - qz.comGNews AI SamsungWHY use OBIX?DEV CommunityBlack Hat USADark ReadingBlack Hat AsiaAI BusinessLLM Context Windows: Managing Tokens in Production AI AppsDEV CommunityPgBouncer: Database Connection Pooling That Actually ScalesDEV CommunityHow to Choose The Best Test Management Software For Your TeamDEV CommunityWhy I Built Scenar.io - An AI-Powered DevOps Interview Practice ToolDEV CommunityOAuth 2.0 Flows Demystified: Authorization Code, PKCE, and Client CredentialsDEV CommunityAI Doesn't Fix Your Development Problems. It Accelerates Them.DEV CommunityWhat Gemma 4's multi-token prediction head actually means for your eval pipelineDEV CommunityThe 3-File Context Kit: Everything Your AI Needs to Understand Your ProjectDEV CommunityMicroservices Communication: REST, gRPC, and Message QueuesDEV Community10 LLM Engineering Concepts Explained in 10 Minutes - KDnuggetsGNews AI RAGSamsung forecasts record Q1 2026 profit, up eightfold, on AI chip demand - qz.comGNews AI SamsungWHY use OBIX?DEV Community
AI NEWS HUBbyEIGENVECTOREigenvector

MeDUET: Disentangled Unified Pretraining for 3D Medical Image Synthesis and Analysis

arXiv eess.IVby Junkai Liu, Ling Shao, Le ZhangApril 7, 20262 min read0 views
Source Quiz

arXiv:2602.17901v2 Announce Type: replace Abstract: Self-supervised learning (SSL) and diffusion models have advanced representation learning and image synthesis, but in 3D medical imaging they are still largely used separately for analysis and synthesis, respectively. Unifying them is appealing but difficult, because multi-source data exhibit pronounced style shifts while downstream tasks rely primarily on anatomy, causing anatomical content and acquisition style to become entangled. In this paper, we propose MeDUET, a 3D Medical image Disentangled UnifiEd PreTraining framework in the variational autoencoder latent space. Our central idea is to treat unified pretraining under heterogeneous multi-center data as a factor identifiability problem, where content should consistently capture ana

View PDF HTML (experimental)

Abstract:Self-supervised learning (SSL) and diffusion models have advanced representation learning and image synthesis, but in 3D medical imaging they are still largely used separately for analysis and synthesis, respectively. Unifying them is appealing but difficult, because multi-source data exhibit pronounced style shifts while downstream tasks rely primarily on anatomy, causing anatomical content and acquisition style to become entangled. In this paper, we propose MeDUET, a 3D Medical image Disentangled UnifiEd PreTraining framework in the variational autoencoder latent space. Our central idea is to treat unified pretraining under heterogeneous multi-center data as a factor identifiability problem, where content should consistently capture anatomy and style should consistently capture appearance. MeDUET addresses this problem through three components. Token demixing provides controllable supervision for factor separation, Mixed Factor Token Distillation reduces factor leakage under mixed regions, and Swap-invariance Quadruplet Contrast promotes factor-wise invariance and discriminability. With these learned factors, MeDUET transfers effectively to both synthesis and analysis, yielding higher fidelity, faster convergence, and better controllability for synthesis, while achieving competitive or superior domain generalization and label efficiency on diverse medical benchmarks. Overall, MeDUET shows that multi-source heterogeneity can serve as useful supervision, with disentanglement providing an effective interface for unifying 3D medical image synthesis and analysis. Our code is available at this https URL.

Subjects:

Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Computer Science and Game Theory (cs.GT)

Cite as: arXiv:2602.17901 [eess.IV]

(or arXiv:2602.17901v2 [eess.IV] for this version)

https://doi.org/10.48550/arXiv.2602.17901

arXiv-issued DOI via DataCite

Submission history

From: Junkai Liu [view email] [v1] Thu, 19 Feb 2026 23:45:23 UTC (3,515 KB) [v2] Sun, 5 Apr 2026 15:40:36 UTC (5,032 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modelbenchmarktraining

Knowledge Map

Knowledge Map
TopicsEntitiesSource
MeDUET: Dis…modelbenchmarktrainingannounceavailableacquisitionarXiv eess.…

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 237 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Models