Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessLetters to Sen. Ed Markey: six autonomous vehicle companies say remote assistants don't directly control vehicles; Tesla says its operators are allowed to do so (Aarian Marshall/Wired)TechmemeAnthropic Just Leaked Claude Code's Source. Here's What It Means for Your Vibe-Coded App.DEV CommunityYou're a slop coder. Autospec is for professionals only.DEV CommunityWhat Happened to CodiumAI? The Rebrand to Qodo ExplainedDEV CommunityWhat Karpathy's Autoresearch Unlocked for MeDEV CommunityOpenClaw Creem agentDEV CommunityStock Market Today, March 31: Nvidia Rises on $2 Billion Marvell AI Infrastructure Partnership - The Motley FoolGNews AI NVIDIAVolt Typhoon Weaponized SOHO Routers at Scale — Here's Your Zero-Trust Playbook for the Remote EdgeDEV CommunityDeep Dive into vLLM: How PagedAttention & Continuous Batching Revolutionized LLM InferenceDEV CommunityFour futures of AI: Life sciences - EYGoogle News: AICan consumers support AI? Just 3% of households are paying subscribers - KTVLGNews AI USAOpenAI: $122 Billion Funding At $852 Billion Valuation Raised To Accelerate Next Phase Of AI - pulse2.comGoogle News: OpenAIBlack Hat USADark ReadingBlack Hat AsiaAI BusinessLetters to Sen. Ed Markey: six autonomous vehicle companies say remote assistants don't directly control vehicles; Tesla says its operators are allowed to do so (Aarian Marshall/Wired)TechmemeAnthropic Just Leaked Claude Code's Source. Here's What It Means for Your Vibe-Coded App.DEV CommunityYou're a slop coder. Autospec is for professionals only.DEV CommunityWhat Happened to CodiumAI? The Rebrand to Qodo ExplainedDEV CommunityWhat Karpathy's Autoresearch Unlocked for MeDEV CommunityOpenClaw Creem agentDEV CommunityStock Market Today, March 31: Nvidia Rises on $2 Billion Marvell AI Infrastructure Partnership - The Motley FoolGNews AI NVIDIAVolt Typhoon Weaponized SOHO Routers at Scale — Here's Your Zero-Trust Playbook for the Remote EdgeDEV CommunityDeep Dive into vLLM: How PagedAttention & Continuous Batching Revolutionized LLM InferenceDEV CommunityFour futures of AI: Life sciences - EYGoogle News: AICan consumers support AI? Just 3% of households are paying subscribers - KTVLGNews AI USAOpenAI: $122 Billion Funding At $852 Billion Valuation Raised To Accelerate Next Phase Of AI - pulse2.comGoogle News: OpenAI

Unsupervised Evaluation of Deep Audio Embeddings for Music Structure Analysis

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2603.27218v1 Announce Type: cross Abstract: Music Structure Analysis (MSA) aims to uncover the high-level organization of musical pieces. State-of-the-art methods are often based on supervised deep learning, but these methods are bottlenecked by the need for heavily annotated data and inherent structural ambiguities. In this paper, we propose an unsupervised evaluation of nine open-source, generic pre-trained deep audio models, on MSA. For each model, we extract barwise embeddings and segment them using three unsupervised segmentation algorithms (Foote's checkerboard kernels, spectral cl — Axel Marmoret

View PDF HTML (experimental)

Abstract:Music Structure Analysis (MSA) aims to uncover the high-level organization of musical pieces. State-of-the-art methods are often based on supervised deep learning, but these methods are bottlenecked by the need for heavily annotated data and inherent structural ambiguities. In this paper, we propose an unsupervised evaluation of nine open-source, generic pre-trained deep audio models, on MSA. For each model, we extract barwise embeddings and segment them using three unsupervised segmentation algorithms (Foote's checkerboard kernels, spectral clustering, and Correlation Block-Matching (CBM)), focusing exclusively on boundary retrieval. Our results demonstrate that modern, generic deep embeddings generally outperform traditional spectrogram-based baselines, but not systematically. Furthermore, our unsupervised boundary estimation methodology generally yields stronger performance than recent linear probing baselines. Among the evaluated techniques, the CBM algorithm consistently emerges as the most effective downstream segmentation method. Finally, we highlight the artificial inflation of standard evaluation metrics and advocate for the systematic adoption of trimming'', or even double trimming'' annotations to establish more rigorous MSA evaluation standards.

Comments: Submitted to the SMC 2026 conference. 2 figures and 2 tables in the main document, 7 figures in Appendix

Subjects:

Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)

ACM classes: H.5.5

Cite as: arXiv:2603.27218 [cs.SD]

(or arXiv:2603.27218v1 [cs.SD] for this version)

https://doi.org/10.48550/arXiv.2603.27218

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Axel Marmoret [view email] [v1] Sat, 28 Mar 2026 10:18:54 UTC (200 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Unsupervise…researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 125 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers