Research Papers research paper arxiv ai artificial-intelligence

Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Recognition Evaluation

arXivMarch 31, 202610 min read0 views

arXiv:2510.06961v4 Announce Type: replace-cross Abstract: We present the Open ASR Leaderboard, a reproducible benchmarking platform with community contributions from academia and industry. It compares 86 open-source and proprietary systems across 12 datasets, with English short- and long-form and multilingual short-form tracks. We standardize word error rate (WER) and inverse real-time factor (RTFx) evaluation for consistent accuracy-efficiency comparisons across model architectures and toolkits (e.g., ESPNet, NeMo, SpeechBrain, Transformers). We observe that Conformer-based encoders paired wi — Vaibhav Srivastav, Steven Zheng, Eric Bezzam, Eustache Le Bihan, Nithin Rao Koluguri, Piotr \.Zelasko, Somshubra Majumdar, Adel Moumen, Sanchit Gandhi

View PDF HTML (experimental)

Abstract:We present the Open ASR Leaderboard, a reproducible benchmarking platform with community contributions from academia and industry. It compares 86 open-source and proprietary systems across 12 datasets, with English short- and long-form and multilingual short-form tracks. We standardize word error rate (WER) and inverse real-time factor (RTFx) evaluation for consistent accuracy-efficiency comparisons across model architectures and toolkits (e.g., ESPNet, NeMo, SpeechBrain, Transformers). We observe that Conformer-based encoders paired with transformer-based decoders achieve the best average WER, while connectionist temporal classification (CTC) and token-and-duration transducer (TDT) decoders offer superior RTFx, making them better suited for long-form and batched processing. All code and dataset loaders are open-sourced to support transparent, extensible evaluation. We present our evaluation methodology to facilitate community-driven benchmarking in ASR and other tasks.

Comments: Leaderboard: this https URL ; Code: this https URL

Subjects:

Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Sound (cs.SD); Audio and Speech Processing (eess.AS)

Cite as: arXiv:2510.06961 [cs.CL]

(or arXiv:2510.06961v4 [cs.CL] for this version)

https://doi.org/10.48550/arXiv.2510.06961

arXiv-issued DOI via DataCite

Submission history

From: Eric Bezzam [view email] [v1] Wed, 8 Oct 2025 12:44:51 UTC (25 KB) [v2] Thu, 9 Oct 2025 07:39:28 UTC (25 KB) [v3] Wed, 10 Dec 2025 17:30:55 UTC (23 KB) [v4] Mon, 30 Mar 2026 09:52:05 UTC (2,783 KB)

Original source

arXiv

https://arxiv.org/abs/2510.06961

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Research Papers

Vector researchers presented more than 50 papers at ICML 2024

Vector researchers presented more than 50 papers at the 2024 International Conference on Machine Learning (ICML). 35 papers co-authored by Vector Faculty Members were accepted to the conference, with a [ ] The post Vector researchers presented more than 50 papers at ICML 2024 appeared first on Vector Institute for Artificial Intelligence .

Vector Institute

1mover 1 year ago

Research Papers

Vector Researchers present papers at ACL 2024

Vector researchers will be well represented at the 62nd Annual Meeting of the Association for Computational Linguistics in Bangkok, Thailand this year. 14 papers co-authored by Vector-affiliated researchers are being [ ] The post Vector Researchers present papers at ACL 2024 appeared first on Vector Institute for Artificial Intelligence .

Vector Institute

1mover 1 year ago

Models

Vector researcher Wenhu Chen on improving and benchmarking foundation models

By Wenhu Chen The past year has seen great progress in foundation models as they achieve expert-level performance in solving challenging, real-world problems. In early 2023, the best open-source 7B [ ] The post Vector researcher Wenhu Chen on improving and benchmarking foundation models appeared first on Vector Institute for Artificial Intelligence .

Vector Institute

1mover 1 year ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 139 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Recognition Evaluation

Submission history

Daily AI Digest

More about

Vector researchers presented more than 50 papers at ICML 2024

Vector Researchers present papers at ACL 2024

Vector researcher Wenhu Chen on improving and benchmarking foundation models

Knowledge Map

Connected Articles — Knowledge Graph

Discussion

More in Research Papers

Vector researchers presented more than 50 papers at ICML 2024

Vector Researchers present papers at ACL 2024

Exclusive | OpenAI’s Former Research Chief Aims to Automate Manufacturing With AI - WSJ

Huihui-Qwen3.5-9B-Abliterated: What This Uncensored Model Does