Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessGamingtak Sony koopt start-up die foto s en video s omzet naar 3dTweakers.net跳出幸存者偏差,从结构性资源分配解析财富真相Dev.to AIJapan's Sakura Internet jumps 20% as Microsoft plans $10 billion AI push with SoftBank - CNBCGNews AI JapanJapan s Sakura Internet jumps 20% as Microsoft plans $10 billion AI push with SoftBankCNBC TechnologyMicrosoft plans $10 billion investment in Japan to grow AI, train 1 million workers by 2030 - livemint.comGNews AI JapanOpenClaw vs Cloud AI: Which One Actually Gives Businesses More Control?Medium AI“In a World of AI Content, Being Human Is Your Superpower”Medium AIHow AI is Transforming the Role of a CFO in 2026.Medium AIHow to Build Self-Running AI Tasks with TypeScript (No Cron Jobs Needed)Dev.to AIMicrosoft to invest $10 billion in Japan to expand AI infrastructure and cybersecurity partnerships - Storyboard18GNews AI JapanFaked Fire Drill!Medium AIMicrosoft to invest $10 bn for Japan AI data centres - France 24GNews AI JapanBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessGamingtak Sony koopt start-up die foto s en video s omzet naar 3dTweakers.net跳出幸存者偏差,从结构性资源分配解析财富真相Dev.to AIJapan's Sakura Internet jumps 20% as Microsoft plans $10 billion AI push with SoftBank - CNBCGNews AI JapanJapan s Sakura Internet jumps 20% as Microsoft plans $10 billion AI push with SoftBankCNBC TechnologyMicrosoft plans $10 billion investment in Japan to grow AI, train 1 million workers by 2030 - livemint.comGNews AI JapanOpenClaw vs Cloud AI: Which One Actually Gives Businesses More Control?Medium AI“In a World of AI Content, Being Human Is Your Superpower”Medium AIHow AI is Transforming the Role of a CFO in 2026.Medium AIHow to Build Self-Running AI Tasks with TypeScript (No Cron Jobs Needed)Dev.to AIMicrosoft to invest $10 billion in Japan to expand AI infrastructure and cybersecurity partnerships - Storyboard18GNews AI JapanFaked Fire Drill!Medium AIMicrosoft to invest $10 bn for Japan AI data centres - France 24GNews AI Japan
AI NEWS HUBbyEIGENVECTOREigenvector

Reverberation-Robust Localization of Speakers Using Distinct Speech Onsets and Multi-channel Cross-Correlations

arXiv eess.ASby Shoufeng LinApril 3, 20261 min read0 views
Source Quiz

arXiv:2604.01524v1 Announce Type: new Abstract: Many speaker localization methods can be found in the literature. However, speaker localization under strong reverberation still remains a major challenge in the real-world applications. This paper proposes two algorithms for localizing speakers using microphone array recordings of reverberated sounds. To separate concurrent speakers, the first algorithm decomposes microphone signals spectrotemporally into subbands via an auditory filterbank. To suppress reverberation, we propose a novel speech onset detection approach derived from the speech signal and impulse response models, and further propose to formulate the multi-channel cross-correlation coefficient (MCCC) of encoded speech onsets in each subband. The subband results are combined to e

View PDF HTML (experimental)

Abstract:Many speaker localization methods can be found in the literature. However, speaker localization under strong reverberation still remains a major challenge in the real-world applications. This paper proposes two algorithms for localizing speakers using microphone array recordings of reverberated sounds. To separate concurrent speakers, the first algorithm decomposes microphone signals spectrotemporally into subbands via an auditory filterbank. To suppress reverberation, we propose a novel speech onset detection approach derived from the speech signal and impulse response models, and further propose to formulate the multi-channel cross-correlation coefficient (MCCC) of encoded speech onsets in each subband. The subband results are combined to estimate the directions-of-arrival (DOAs) of speakers. The second algorithm extends the generalized cross-correlation - phase transform (GCC-PHAT) method by using redundant information of multiple microphones to address the reverberation problem. The proposed methods have been evaluated under adverse conditions using not only simulated signals (reverberation time $T_{60}$ of up to $1$s) but also recordings in a real reverberant room ($T_{60} \approx 0.65$s). Comparing with some state-of-the-art localization methods, experimental results confirm that the proposed methods can reliably locate static and moving speakers, in presence of reverberation.

Subjects:

Audio and Speech Processing (eess.AS)

Cite as: arXiv:2604.01524 [eess.AS]

(or arXiv:2604.01524v1 [eess.AS] for this version)

https://doi.org/10.48550/arXiv.2604.01524

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Shoufeng Lin [view email] [v1] Thu, 2 Apr 2026 01:52:10 UTC (969 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modelannounceapplication

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Reverberati…modelannounceapplicationpaperarxivarXiv eess.…

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 199 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Products