Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessExclusive: Miravoice, Builder Of An AI ‘Interviewer’ To Conduct Phone Surveys, Raises $6.3MCrunchbase NewsMaul: Shadow Lord Will Return for Season 2GizmodoA jury says Meta and Google hurt a kid. What now?The Verge AIHow Disney Imagineers are using AI and robotics to reshape the company’s theme parksFast Company TechCapacity and speed: why TikTok shelved its second Irish data centreSilicon RepublicDiverse teams start with diverse VCsTechCrunch AIChatGPT contractor building deradicalization chatbot after school shooter scandal - cybernews.comGoogle News: ChatGPTThis even smaller credit card-sized e-reader has one tragic flawThe VergeWhat history can teach us about AI - Johns Hopkins UniversityGNews AI USAContextCore: AI Agents conversations to an MCP-queryable memory layerDEV Community7 ways Dubai’s AI-powered government will change your daily life in the UAE - Gulf NewsGoogle News AI UAEI Built a 209-Page Sauna Site Without Knowing How to CodeDEV CommunityBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessExclusive: Miravoice, Builder Of An AI ‘Interviewer’ To Conduct Phone Surveys, Raises $6.3MCrunchbase NewsMaul: Shadow Lord Will Return for Season 2GizmodoA jury says Meta and Google hurt a kid. What now?The Verge AIHow Disney Imagineers are using AI and robotics to reshape the company’s theme parksFast Company TechCapacity and speed: why TikTok shelved its second Irish data centreSilicon RepublicDiverse teams start with diverse VCsTechCrunch AIChatGPT contractor building deradicalization chatbot after school shooter scandal - cybernews.comGoogle News: ChatGPTThis even smaller credit card-sized e-reader has one tragic flawThe VergeWhat history can teach us about AI - Johns Hopkins UniversityGNews AI USAContextCore: AI Agents conversations to an MCP-queryable memory layerDEV Community7 ways Dubai’s AI-powered government will change your daily life in the UAE - Gulf NewsGoogle News AI UAEI Built a 209-Page Sauna Site Without Knowing How to CodeDEV Community
AI NEWS HUBbyEIGENVECTOREigenvector

Enhancing Automatic Chord Recognition via Pseudo-Labeling and Knowledge Distillation

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2602.19778v3 Announce Type: replace-cross Abstract: Automatic Chord Recognition (ACR) is constrained by the scarcity of aligned chord labels, as well-aligned annotations are costly to acquire. At the same time, open-weight pre-trained models are currently more accessible than their proprietary training data. In this work, we present a two-stage training pipeline that leverages pre-trained models together with unlabeled audio. The proposed method decouples training into two stages. In the first stage, we use a pre-trained BTC model as a teacher to generate pseudo-labels for over 1,000 hou — Nghia Phan, Rong Jin, Gang Liu, Xiao Dong

View PDF HTML (experimental)

Abstract:Automatic Chord Recognition (ACR) is constrained by the scarcity of aligned chord labels, as well-aligned annotations are costly to acquire. At the same time, open-weight pre-trained models are currently more accessible than their proprietary training data. In this work, we present a two-stage training pipeline that leverages pre-trained models together with unlabeled audio. The proposed method decouples training into two stages. In the first stage, we use a pre-trained BTC model as a teacher to generate pseudo-labels for over 1,000 hours of diverse unlabeled audio and train a student model solely on these pseudo-labels. In the second stage, the student is continually trained on ground-truth labels as they become available. To prevent catastrophic forgetting of the representations learned in the first stage, we apply selective knowledge distillation (KD) from the teacher as a regularizer. In our experiments, two models (BTC, 2E1D) were used as students. In stage 1, using only pseudo-labels, the BTC student achieves over 99% of the teacher's performance, while the 2E1D model achieves about 97% across seven standard mir_eval metrics. After a single training run for both students in stage 2, the resulting BTC student model surpasses the traditional supervised learning baseline by 2.5% and the original pre-trained teacher model by 1.1-3.2% across all metrics. The resulting 2E1D student model improves over the traditional supervised learning baseline by 2.67% on average and achieves almost the same performance as the teacher. Both cases show large gains on rare chord qualities.

Comments: 8 pages, 6 figures, 3 tables

Subjects:

Sound (cs.SD); Information Retrieval (cs.IR); Machine Learning (cs.LG); Multimedia (cs.MM)

Cite as: arXiv:2602.19778 [cs.SD]

(or arXiv:2602.19778v3 [cs.SD] for this version)

https://doi.org/10.48550/arXiv.2602.19778

arXiv-issued DOI via DataCite

Submission history

From: Nghia Phan [view email] [v1] Mon, 23 Feb 2026 12:32:53 UTC (2,264 KB) [v2] Thu, 26 Mar 2026 17:38:09 UTC (2,267 KB) [v3] Sat, 28 Mar 2026 09:06:08 UTC (2,265 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Enhancing A…researchpaperarxivmachine-lea…deep-learni…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 157 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!