Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessHow Disney Imagineers are using AI and robotics to reshape the company’s theme parksFast Company TechArtemis II: Why our return to the moon took so longFast Company TechPost Quantum Cryptography - ComputerphileComputerphile YTPickNik Robotics gives MoveIt Pro 9.0 enhanced perception-to-motion, teleop capabilitiesRobotics Business ReviewScientists Build Living Robots With Nervous SystemsIEEE RoboticsHow AI agents are changing journalismFast Company TechWhy AI health chatbots won’t make you better at diagnosing yourself – new research - Gavi, the Vaccine AllianceGoogle News: AIRun OpenCode in Docker - Clean machine, same convenienceDEV CommunityGood UI Is Just Invisible EngineeringDEV CommunityFace Tracking for Vertical Video: Why It's Harder Than It Looks (And How It Works)DEV CommunityI Built a Privacy-First Developer Toolbox That Runs 100% in Your BrowserDEV CommunityI Published 3 Products on Gumroad. 0 Sales. Here's My Honest Postmortem.DEV CommunityBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessHow Disney Imagineers are using AI and robotics to reshape the company’s theme parksFast Company TechArtemis II: Why our return to the moon took so longFast Company TechPost Quantum Cryptography - ComputerphileComputerphile YTPickNik Robotics gives MoveIt Pro 9.0 enhanced perception-to-motion, teleop capabilitiesRobotics Business ReviewScientists Build Living Robots With Nervous SystemsIEEE RoboticsHow AI agents are changing journalismFast Company TechWhy AI health chatbots won’t make you better at diagnosing yourself – new research - Gavi, the Vaccine AllianceGoogle News: AIRun OpenCode in Docker - Clean machine, same convenienceDEV CommunityGood UI Is Just Invisible EngineeringDEV CommunityFace Tracking for Vertical Video: Why It's Harder Than It Looks (And How It Works)DEV CommunityI Built a Privacy-First Developer Toolbox That Runs 100% in Your BrowserDEV CommunityI Published 3 Products on Gumroad. 0 Sales. Here's My Honest Postmortem.DEV Community
AI NEWS HUBbyEIGENVECTOREigenvector

Local Precise Refinement: A Dual-Gated Mixture-of-Experts for Enhancing Foundation Model Generalization against Spectral Shifts

arXivMarch 30, 202610 min read0 views
Source Quiz

arXiv:2603.13352v3 Announce Type: replace Abstract: Domain Generalization Semantic Segmentation (DGSS) in spectral remote sensing is severely challenged by spectral shifts across diverse acquisition conditions, which cause significant performance degradation for models deployed in unseen domains. While fine-tuning foundation models is a promising direction, existing methods employ global, homogeneous adjustments. This "one-size-fits-all" tuning struggles with the spatial heterogeneity of land cover, causing semantic confusion. We argue that the key to robust DGSS lies not in a single global ad — Xi Chen, Maojun Zhang, Yu Liu, Shen Yan

View PDF HTML (experimental)

Abstract:Domain Generalization Semantic Segmentation (DGSS) in spectral remote sensing is severely challenged by spectral shifts across diverse acquisition conditions, which cause significant performance degradation for models deployed in unseen domains. While fine-tuning foundation models is a promising direction, existing methods employ global, homogeneous adjustments. This "one-size-fits-all" tuning struggles with the spatial heterogeneity of land cover, causing semantic confusion. We argue that the key to robust DGSS lies not in a single global adaptation, but in performing fine-grained, spatially-adaptive refinement of a foundation model's features. To achieve this, we propose SpectralMoE, a novel fine-tuning framework for DGSS. It operationalizes this principle by utilizing a Mixture-of-Experts (MoE) architecture to perform \textbf{local precise refinement} on the foundation model's features, incorporating depth features estimated from selected RGB bands of the spectral remote sensing imagery to guide the fine-tuning process. Specifically, SpectralMoE employs a dual-gated MoE architecture that independently routes visual and depth features to top-k selected experts for specialized refinement, enabling modality-specific adjustments. A subsequent cross-attention mechanism then judiciously fuses the refined structural cues into the visual stream, mitigating semantic ambiguities caused by spectral variations. Extensive experiments show that SpectralMoE sets a new state-of-the-art on multiple DGSS benchmarks across hyperspectral, multispectral, and RGB remote sensing imagery.

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2603.13352 [cs.CV]

(or arXiv:2603.13352v3 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.13352

arXiv-issued DOI via DataCite

Submission history

From: Silas Chen [view email] [v1] Sun, 8 Mar 2026 04:53:05 UTC (13,898 KB) [v2] Tue, 24 Mar 2026 10:11:10 UTC (2,983 KB) [v3] Fri, 27 Mar 2026 10:15:57 UTC (2,982 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Local Preci…researchpaperarxivcomputer-vi…image-recog…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 205 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers