Research Papers research paper arxiv ai artificial-intelligence

Dual-branch Graph Domain Adaptation for Cross-scenario Multi-modal Emotion Recognition

arXivby [Submitted on 27 Mar 2026]March 31, 20262 min read1 views

arXiv:2603.26840v1 Announce Type: cross Abstract: Multimodal Emotion Recognition in Conversations (MERC) aims to predict speakers' emotional states in multi-turn dialogues through text, audio, and visual cues. In real-world settings, conversation scenarios differ significantly in speakers, topics, styles, and noise levels. Existing MERC methods generally neglect these cross-scenario variations, limiting their ability to transfer models trained on a source domain to unseen target domains. To address this issue, we propose a Dual-branch Graph Domain Adaptation framework (DGDA) for multimodal emo — Yuntao Shou, Jun Zhou, Tao Meng, Wei Ai, Keqin Li

View PDF HTML (experimental)

Abstract:Multimodal Emotion Recognition in Conversations (MERC) aims to predict speakers' emotional states in multi-turn dialogues through text, audio, and visual cues. In real-world settings, conversation scenarios differ significantly in speakers, topics, styles, and noise levels. Existing MERC methods generally neglect these cross-scenario variations, limiting their ability to transfer models trained on a source domain to unseen target domains. To address this issue, we propose a Dual-branch Graph Domain Adaptation framework (DGDA) for multimodal emotion recognition under cross-scenario conditions. We first construct an emotion interaction graph to characterize complex emotional dependencies among utterances. A dual-branch encoder, consisting of a hypergraph neural network (HGNN) and a path neural network (PathNN), is then designed to explicitly model multivariate relationships and implicitly capture global dependencies. To enable out-of-domain generalization, a domain adversarial discriminator is introduced to learn invariant representations across domains. Furthermore, a regularization loss is incorporated to suppress the negative influence of noisy labels. To the best of our knowledge, DGDA is the first MERC framework that jointly addresses domain shift and label noise. Theoretical analysis provides tighter generalization bounds, and extensive experiments on IEMOCAP and MELD demonstrate that DGDA consistently outperforms strong baselines and better adapts to cross-scenario conversations. Our code is available at this https URL.

Comments: 29 pages

Subjects:

Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI)

Cite as: arXiv:2603.26840 [eess.AS]

(or arXiv:2603.26840v1 [eess.AS] for this version)

https://doi.org/10.48550/arXiv.2603.26840

arXiv-issued DOI via DataCite

Submission history

From: Yuntao Shou [view email] [v1] Fri, 27 Mar 2026 08:21:09 UTC (9,223 KB)

Original source

arXiv

https://arxiv.org/abs/2603.26840

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Products

Scaling Synthetic Task Generation for Agents via Exploration - Apple Machine Learning Research

Scaling Synthetic Task Generation for Agents via Exploration Apple Machine Learning Research

Google News: Machine Learning

1m12 days ago

Research Papers

This Ancient Roman Game Board Was a Mystery. Researchers Used A.I. to Figure Out How to Play - Smithsonian Magazine

This Ancient Roman Game Board Was a Mystery. Researchers Used A.I. to Figure Out How to Play Smithsonian Magazine

GNews AI Netherlands

1mabout 1 month ago

ProductsFresh

How to Build an AI Content Playbook That Actually Protects Your Voice

Ahnii! You've read the articles warning you not to let AI take over your content. Ruth Doherty's latest piece is one of the best: a clear-eyed breakdown of where AI helps and where it silently destroys your brand. This post shows you how to take that framework and turn it into an actual operating document for your content pipeline. Why a Framework Without a Playbook Doesn't Stick Ruth's core argument is sharp: AI is an efficiency engine, not a strategy engine. Use it for research, structuring, repurposing, and editing. Keep it away from messaging, customer research, and anything that requires your actual point of view. That distinction is easy to agree with. It's harder to enforce on a Tuesday afternoon when you're behind on three social posts and the AI can draft all of them in 90 seconds

Dev.to AI

6mabout 2 hours ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 139 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

Dual-branch Graph Domain Adaptation for Cross-scenario Multi-modal Emotion Recognition

Submission history

Daily AI Digest

More about

Scaling Synthetic Task Generation for Agents via Exploration - Apple Machine Learning Research

This Ancient Roman Game Board Was a Mystery. Researchers Used A.I. to Figure Out How to Play - Smithsonian Magazine

How to Build an AI Content Playbook That Actually Protects Your Voice

Knowledge Map

Connected Articles — Knowledge Graph

Discussion

More in Research Papers

This Ancient Roman Game Board Was a Mystery. Researchers Used A.I. to Figure Out How to Play - Smithsonian Magazine

URI Day Highlights Student Research and the Future of AI Education in Rhode Island - uri.edu

AI could transform patient education in eye care, new research shows - Medical Xpress

🥇Top AI Papers of the Week