MD-RWKV-UNet: Scale-Aware Anatomical Encoding with Cross-Stage Fusion for Multi-Organ Segmentation
arXiv:2603.27261v1 Announce Type: new Abstract: Multi-organ segmentation in medical imaging remains challenging due to large anatomical variability, complex inter-organ dependencies, and diverse organ scales and shapes. Conventional encoder-decoder architectures often struggle to capture both fine-grained local details and long-range context, which are crucial for accurate delineation - especially for small or deformable organs. To address these limitations, we propose MD-RWKV-UNet, a dynamic encoder network that enables scale-aware representation and spatially adaptive context modeling. At it — Zhuoyi Fang
View PDF HTML (experimental)
Abstract:Multi-organ segmentation in medical imaging remains challenging due to large anatomical variability, complex inter-organ dependencies, and diverse organ scales and shapes. Conventional encoder-decoder architectures often struggle to capture both fine-grained local details and long-range context, which are crucial for accurate delineation - especially for small or deformable organs. To address these limitations, we propose MD-RWKV-UNet, a dynamic encoder network that enables scale-aware representation and spatially adaptive context modeling. At its core is the MD-RWKV block, a dual-path module that integrates deformable spatial shifts with the Receptance Weighted Key Value mechanism, allowing the receptive field to adapt dynamically to local structural cues. We further incorporate Selective Kernel Attention to enable adaptive selection of convolutional kernels with varying receptive fields, enhancing multi-scale interaction and improving robustness to organ size and shape variation. In parallel, a cross-stage dual-attention fusion strategy aggregates multi-level features across the encoder, preserving low-level structure while enhancing semantic consistency. Unlike methods that stack static convolutions or rely heavily on global attention, our approach provides a lightweight yet expressive solution for dynamic organ modeling. Experiments on Synapse and ACDC demonstrate state-of-the-art performance, particularly in boundary precision and small-organ segmentation.
Subjects:
Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
Cite as: arXiv:2603.27261 [cs.CV]
(or arXiv:2603.27261v1 [cs.CV] for this version)
https://doi.org/10.48550/arXiv.2603.27261
arXiv-issued DOI via DataCite (pending registration)
Submission history
From: Zhuoyi Fang [view email] [v1] Sat, 28 Mar 2026 12:53:50 UTC (12,387 KB)
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
researchpaperarxiv
Major 4-day workweek study suggests that when we work 5 days we spend one doing basically nothing
Research says workers can get as much done in a 33-hour week as in 38 hours. Essentially, those of us on a five-day week are filling up our days with time-wasting activities.

UKRI Deems Turing Institute Not Yet Satisfactory
UK Research and Innovation (UKRI) found that the Alan Turing Institute s strategic alignment and value for money are not yet satisfactory in a review of the AI research body s performance. The Turing Institute has dealt with a tumultuous year, with its head stepping down amid pushback from staff complaining about a toxic work environment. The [ ] The post UKRI Deems Turing Institute Not Yet Satisfactory appeared first on DIGIT .
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.





Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!