Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessLetters to Sen. Ed Markey: six autonomous vehicle companies say remote assistants don't directly control vehicles; Tesla says its operators are allowed to do so (Aarian Marshall/Wired)TechmemeAnthropic Just Leaked Claude Code's Source. Here's What It Means for Your Vibe-Coded App.DEV CommunityYou're a slop coder. Autospec is for professionals only.DEV CommunityWhat Happened to CodiumAI? The Rebrand to Qodo ExplainedDEV CommunityWhat Karpathy's Autoresearch Unlocked for MeDEV CommunityOpenClaw Creem agentDEV CommunityStock Market Today, March 31: Nvidia Rises on $2 Billion Marvell AI Infrastructure Partnership - The Motley FoolGNews AI NVIDIAVolt Typhoon Weaponized SOHO Routers at Scale — Here's Your Zero-Trust Playbook for the Remote EdgeDEV CommunityDeep Dive into vLLM: How PagedAttention & Continuous Batching Revolutionized LLM InferenceDEV CommunityFour futures of AI: Life sciences - EYGoogle News: AICan consumers support AI? Just 3% of households are paying subscribers - KTVLGNews AI USAOpenAI: $122 Billion Funding At $852 Billion Valuation Raised To Accelerate Next Phase Of AI - pulse2.comGoogle News: OpenAIBlack Hat USADark ReadingBlack Hat AsiaAI BusinessLetters to Sen. Ed Markey: six autonomous vehicle companies say remote assistants don't directly control vehicles; Tesla says its operators are allowed to do so (Aarian Marshall/Wired)TechmemeAnthropic Just Leaked Claude Code's Source. Here's What It Means for Your Vibe-Coded App.DEV CommunityYou're a slop coder. Autospec is for professionals only.DEV CommunityWhat Happened to CodiumAI? The Rebrand to Qodo ExplainedDEV CommunityWhat Karpathy's Autoresearch Unlocked for MeDEV CommunityOpenClaw Creem agentDEV CommunityStock Market Today, March 31: Nvidia Rises on $2 Billion Marvell AI Infrastructure Partnership - The Motley FoolGNews AI NVIDIAVolt Typhoon Weaponized SOHO Routers at Scale — Here's Your Zero-Trust Playbook for the Remote EdgeDEV CommunityDeep Dive into vLLM: How PagedAttention & Continuous Batching Revolutionized LLM InferenceDEV CommunityFour futures of AI: Life sciences - EYGoogle News: AICan consumers support AI? Just 3% of households are paying subscribers - KTVLGNews AI USAOpenAI: $122 Billion Funding At $852 Billion Valuation Raised To Accelerate Next Phase Of AI - pulse2.comGoogle News: OpenAI

Membership Inference Attacks against Large Audio Language Models

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2603.28378v1 Announce Type: cross Abstract: We present the first systematic Membership Inference Attack (MIA) evaluation of Large Audio Language Models (LALMs). As audio encodes non-semantic information, it induces severe train and test distribution shifts and can lead to spurious MIA performance. Using a multi-modal blind baseline based on textual, spectral, and prosodic features, we demonstrate that common speech datasets exhibit near-perfect train/test separability (AUC approximately 1.0) even without model inference, and the standard MIA scores strongly correlate with these blind aco — Jia-Kai Dong, Yu-Xiang Lin, Hung-Yi Lee

View PDF HTML (experimental)

Abstract:We present the first systematic Membership Inference Attack (MIA) evaluation of Large Audio Language Models (LALMs). As audio encodes non-semantic information, it induces severe train and test distribution shifts and can lead to spurious MIA performance. Using a multi-modal blind baseline based on textual, spectral, and prosodic features, we demonstrate that common speech datasets exhibit near-perfect train/test separability (AUC approximately 1.0) even without model inference, and the standard MIA scores strongly correlate with these blind acoustic artifacts (correlation greater than 0.7). Using this blind baseline, we identify that distribution-matched datasets enable reliable MIA evaluation without distribution shift confounds. We benchmark multiple MIA methods and conduct modality disentanglement experiments on these datasets. The results reveal that LALM memorization is cross-modal, arising only from binding a speaker's vocal identity with its text. These findings establish a principled standard for auditing LALMs beyond spurious correlations.

Comments: submitted to Interspeech 2026

Subjects:

Sound (cs.SD); Artificial Intelligence (cs.AI)

ACM classes: I.2.6; I.5.2; K.4.1

Cite as: arXiv:2603.28378 [cs.SD]

(or arXiv:2603.28378v1 [cs.SD] for this version)

https://doi.org/10.48550/arXiv.2603.28378

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Jia-Kai Dong [view email] [v1] Mon, 30 Mar 2026 12:45:28 UTC (325 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Membership …researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 139 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers