Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessAI agents are now playing Mafia (social deduction with humans)Hacker News AI TopLet's be Honest about AI CodingHacker News AI Toptrunk/bc68fe94fe043b4c8484129d229012735df224e1PyTorch ReleasesHow to Build Production-Ready Agentic Systems with Z.AI GLM-5 Using Thinking Mode, Tool Calling, Streaming, and Multi-Turn WorkflowsMarkTechPostBillion dollar AI company was built on lies [video]Hacker News AI Toptrunk/08b65b957401b4df41e7d458d953f237e06eae9a: Remove stale Python comments (#179106)PyTorch ReleasesComparing Today's Multi-Model DatabasesDEV CommunityBuilding a WeChat Mini Program Pre-Sale System from Scratch: A Builder's LogDEV CommunityOpenAI sees a new round of executive shake-upsBusiness Insider26 Quizzes: What We've Learned About Which Results People Actually ShareDEV CommunityLayered Agentic Retrieval for Retail Floor Questions: A Solo PoCDEV CommunityHow to Handle Sensitive Data Securely in TerraformDEV CommunityBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessAI agents are now playing Mafia (social deduction with humans)Hacker News AI TopLet's be Honest about AI CodingHacker News AI Toptrunk/bc68fe94fe043b4c8484129d229012735df224e1PyTorch ReleasesHow to Build Production-Ready Agentic Systems with Z.AI GLM-5 Using Thinking Mode, Tool Calling, Streaming, and Multi-Turn WorkflowsMarkTechPostBillion dollar AI company was built on lies [video]Hacker News AI Toptrunk/08b65b957401b4df41e7d458d953f237e06eae9a: Remove stale Python comments (#179106)PyTorch ReleasesComparing Today's Multi-Model DatabasesDEV CommunityBuilding a WeChat Mini Program Pre-Sale System from Scratch: A Builder's LogDEV CommunityOpenAI sees a new round of executive shake-upsBusiness Insider26 Quizzes: What We've Learned About Which Results People Actually ShareDEV CommunityLayered Agentic Retrieval for Retail Floor Questions: A Solo PoCDEV CommunityHow to Handle Sensitive Data Securely in TerraformDEV Community
AI NEWS HUBbyEIGENVECTOREigenvector

Can LLM Agents Identify Spoken Dialects like a Linguist?

arXiv cs.CLby Tobias Bystrich, Lukas Hamm, Maria Hassan, Lea Fischbach, Lucie Flek, Akbar KarimiApril 1, 20261 min read0 views
Source Quiz

arXiv:2603.29541v1 Announce Type: new Abstract: Due to the scarcity of labeled dialectal speech, audio dialect classification is a challenging task for most languages, including Swiss German. In this work, we explore the ability of large language models (LLMs) as agents in understanding the dialects and whether they can show comparable performance to models such as HuBERT in dialect classification. In addition, we provide an LLM baseline and a human linguist one. Our approach uses phonetic transcriptions produced by ASR systems and combines them with linguistic resources such as dialect feature maps, vowel history, and rules. Our findings indicate that, when linguistic information is provided, the LLM predictions improve. The human baseline shows that automatically generated transcriptions

View PDF HTML (experimental)

Abstract:Due to the scarcity of labeled dialectal speech, audio dialect classification is a challenging task for most languages, including Swiss German. In this work, we explore the ability of large language models (LLMs) as agents in understanding the dialects and whether they can show comparable performance to models such as HuBERT in dialect classification. In addition, we provide an LLM baseline and a human linguist one. Our approach uses phonetic transcriptions produced by ASR systems and combines them with linguistic resources such as dialect feature maps, vowel history, and rules. Our findings indicate that, when linguistic information is provided, the LLM predictions improve. The human baseline shows that automatically generated transcriptions can be beneficial for such classifications, but also presents opportunities for improvement.

Comments: Accepted to DialRes Workshop @ LREC 2026

Subjects:

Computation and Language (cs.CL)

Cite as: arXiv:2603.29541 [cs.CL]

(or arXiv:2603.29541v1 [cs.CL] for this version)

https://doi.org/10.48550/arXiv.2603.29541

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Akbar Karimi [view email] [v1] Tue, 31 Mar 2026 10:24:20 UTC (960 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Can LLM Age…modellanguage mo…announcefeaturepredictionagentarXiv cs.CL

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 174 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!