Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessExclusive: Miravoice, Builder Of An AI ‘Interviewer’ To Conduct Phone Surveys, Raises $6.3MCrunchbase NewsMaul: Shadow Lord Will Return for Season 2GizmodoA jury says Meta and Google hurt a kid. What now?The Verge AIHow Disney Imagineers are using AI and robotics to reshape the company’s theme parksFast Company TechCapacity and speed: why TikTok shelved its second Irish data centreSilicon RepublicDiverse teams start with diverse VCsTechCrunch AIThis even smaller credit card-sized e-reader has one tragic flawThe VergeWhat history can teach us about AI - Johns Hopkins UniversityGNews AI USAContextCore: AI Agents conversations to an MCP-queryable memory layerDEV Community7 ways Dubai’s AI-powered government will change your daily life in the UAE - Gulf NewsGoogle News AI UAEI Built a 209-Page Sauna Site Without Knowing How to CodeDEV CommunityGoogle Home’s latest update makes Gemini better at understanding your commandsThe VergeBlack Hat USADark ReadingBlack Hat AsiaAI BusinessExclusive: Miravoice, Builder Of An AI ‘Interviewer’ To Conduct Phone Surveys, Raises $6.3MCrunchbase NewsMaul: Shadow Lord Will Return for Season 2GizmodoA jury says Meta and Google hurt a kid. What now?The Verge AIHow Disney Imagineers are using AI and robotics to reshape the company’s theme parksFast Company TechCapacity and speed: why TikTok shelved its second Irish data centreSilicon RepublicDiverse teams start with diverse VCsTechCrunch AIThis even smaller credit card-sized e-reader has one tragic flawThe VergeWhat history can teach us about AI - Johns Hopkins UniversityGNews AI USAContextCore: AI Agents conversations to an MCP-queryable memory layerDEV Community7 ways Dubai’s AI-powered government will change your daily life in the UAE - Gulf NewsGoogle News AI UAEI Built a 209-Page Sauna Site Without Knowing How to CodeDEV CommunityGoogle Home’s latest update makes Gemini better at understanding your commandsThe Verge
AI NEWS HUBbyEIGENVECTOREigenvector

Constructing Composite Features for Interpretable Music-Tagging

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2603.28644v1 Announce Type: cross Abstract: Combining multiple audio features can improve the performance of music tagging, but common deep learning-based feature fusion methods often lack interpretability. To address this problem, we propose a Genetic Programming (GP) pipeline that automatically evolves composite features by mathematically combining base music features, thereby capturing synergistic interactions while preserving interpretability. This approach provides representational benefits similar to deep feature fusion without sacrificing interpretability. Experiments on the MTG-J — Chenhao Xue, Weitao Hu, Joyraj Chakraborty, Zhijin Guo, Kang Li, Tianyu Shi, Martin Reed, Nikolaos Thomos

View PDF HTML (experimental)

Abstract:Combining multiple audio features can improve the performance of music tagging, but common deep learning-based feature fusion methods often lack interpretability. To address this problem, we propose a Genetic Programming (GP) pipeline that automatically evolves composite features by mathematically combining base music features, thereby capturing synergistic interactions while preserving interpretability. This approach provides representational benefits similar to deep feature fusion without sacrificing interpretability. Experiments on the MTG-Jamendo and GTZAN datasets demonstrate consistent improvements compared to state-of-the-art systems across base feature sets at different abstraction levels. It should be noted that most of the performance gains are noticed within the first few hundred GP evaluations, indicating that effective feature combinations can be identified under modest search budgets. The top evolved expressions include linear, nonlinear, and conditional forms, with various low-complexity solutions at top performance aligned with parsimony pressure to prefer simpler expressions. Analyzing these composite features further reveals which interactions and transformations tend to be beneficial for tagging, offering insights that remain opaque in black-box deep models.

Comments: 5 pages, 8 figures, accepted at ICASSP 2026

Subjects:

Sound (cs.SD); Machine Learning (cs.LG); Multimedia (cs.MM)

Cite as: arXiv:2603.28644 [cs.SD]

(or arXiv:2603.28644v1 [cs.SD] for this version)

https://doi.org/10.48550/arXiv.2603.28644

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Chenhao Xue [view email] [v1] Mon, 30 Mar 2026 16:25:58 UTC (4,896 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Constructin…researchpaperarxivmachine-lea…deep-learni…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 169 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!