Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessWhy Developer Productivity Engineering is UnderratedDEV CommunityMatrices in PythonDEV CommunityUse OpenClaw to Make a Personal AI AssistantTowards AIQodo vs Sourcery: AI Code Review Approaches Compared (2026)DEV CommunityCreating a 50 GB Swap File on Jetson AGX Orin (Root on NVMe)DEV CommunityFrom Redis to Valkey: pre-migration Reconnaissance — detect all apps & connections in realtimeDEV CommunityMuri: The Root Cause of OverburdenDEV CommunityStop Guessing What Caused Your Flaky Tests Fail or PassDEV CommunityMura: The Source of Uneven FlowDEV Community🚀 The Developer Who Survives 2026 Is NOT the One You ThinkDEV CommunityThe UK government reportedly wants Anthropic to expand its presence in LondonEngadget"Open the Fuckin' Strait": Trump threatens to start bombing civilian infrastructure TuesdayAxios TechBlack Hat USADark ReadingBlack Hat AsiaAI BusinessWhy Developer Productivity Engineering is UnderratedDEV CommunityMatrices in PythonDEV CommunityUse OpenClaw to Make a Personal AI AssistantTowards AIQodo vs Sourcery: AI Code Review Approaches Compared (2026)DEV CommunityCreating a 50 GB Swap File on Jetson AGX Orin (Root on NVMe)DEV CommunityFrom Redis to Valkey: pre-migration Reconnaissance — detect all apps & connections in realtimeDEV CommunityMuri: The Root Cause of OverburdenDEV CommunityStop Guessing What Caused Your Flaky Tests Fail or PassDEV CommunityMura: The Source of Uneven FlowDEV Community🚀 The Developer Who Survives 2026 Is NOT the One You ThinkDEV CommunityThe UK government reportedly wants Anthropic to expand its presence in LondonEngadget"Open the Fuckin' Strait": Trump threatens to start bombing civilian infrastructure TuesdayAxios Tech
AI NEWS HUBbyEIGENVECTOREigenvector

Dual-branch Graph Domain Adaptation for Cross-scenario Multi-modal Emotion Recognition

arXivby [Submitted on 27 Mar 2026]March 31, 20262 min read1 views
Source Quiz

arXiv:2603.26840v1 Announce Type: cross Abstract: Multimodal Emotion Recognition in Conversations (MERC) aims to predict speakers' emotional states in multi-turn dialogues through text, audio, and visual cues. In real-world settings, conversation scenarios differ significantly in speakers, topics, styles, and noise levels. Existing MERC methods generally neglect these cross-scenario variations, limiting their ability to transfer models trained on a source domain to unseen target domains. To address this issue, we propose a Dual-branch Graph Domain Adaptation framework (DGDA) for multimodal emo — Yuntao Shou, Jun Zhou, Tao Meng, Wei Ai, Keqin Li

View PDF HTML (experimental)

Abstract:Multimodal Emotion Recognition in Conversations (MERC) aims to predict speakers' emotional states in multi-turn dialogues through text, audio, and visual cues. In real-world settings, conversation scenarios differ significantly in speakers, topics, styles, and noise levels. Existing MERC methods generally neglect these cross-scenario variations, limiting their ability to transfer models trained on a source domain to unseen target domains. To address this issue, we propose a Dual-branch Graph Domain Adaptation framework (DGDA) for multimodal emotion recognition under cross-scenario conditions. We first construct an emotion interaction graph to characterize complex emotional dependencies among utterances. A dual-branch encoder, consisting of a hypergraph neural network (HGNN) and a path neural network (PathNN), is then designed to explicitly model multivariate relationships and implicitly capture global dependencies. To enable out-of-domain generalization, a domain adversarial discriminator is introduced to learn invariant representations across domains. Furthermore, a regularization loss is incorporated to suppress the negative influence of noisy labels. To the best of our knowledge, DGDA is the first MERC framework that jointly addresses domain shift and label noise. Theoretical analysis provides tighter generalization bounds, and extensive experiments on IEMOCAP and MELD demonstrate that DGDA consistently outperforms strong baselines and better adapts to cross-scenario conversations. Our code is available at this https URL.

Comments: 29 pages

Subjects:

Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI)

Cite as: arXiv:2603.26840 [eess.AS]

(or arXiv:2603.26840v1 [eess.AS] for this version)

https://doi.org/10.48550/arXiv.2603.26840

arXiv-issued DOI via DataCite

Submission history

From: Yuntao Shou [view email] [v1] Fri, 27 Mar 2026 08:21:09 UTC (9,223 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Dual-branch…researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 139 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!