Models model language model training announce open-source application

J-CHAT: Japanese Large-scale Spoken Dialogue Corpus for Spoken Dialogue Language Modeling

arXiv eess.ASby Wataru Nakata, Kentaro Seki, Hitomi Yanaka, Yuki Saito, Shinnosuke Takamichi, Hiroshi SaruwatariApril 3, 20261 min read1 views

Source Quiz

arXiv:2407.15828v2 Announce Type: replace-cross Abstract: Spoken dialogue is essential for human-AI interactions, providing expressive capabilities beyond text. Developing effective spoken dialogue systems (SDSs) requires large-scale, high-quality, and diverse spoken dialogue corpora. However, existing datasets are often limited in size, spontaneity, or linguistic coherence. To address these limitations, we introduce J-CHAT, a 76,000-hour open-source Japanese spoken dialogue corpus. Constructed using an automated, language-independent methodology, J-CHAT ensures acoustic cleanliness, diversity, and natural spontaneity. The corpus is built from YouTube and podcast data, with extensive filtering and denoising to enhance quality. Experimental results with generative spoken dialogue language m

View PDF HTML (experimental)

Abstract:Spoken dialogue is essential for human-AI interactions, providing expressive capabilities beyond text. Developing effective spoken dialogue systems (SDSs) requires large-scale, high-quality, and diverse spoken dialogue corpora. However, existing datasets are often limited in size, spontaneity, or linguistic coherence. To address these limitations, we introduce J-CHAT, a 76,000-hour open-source Japanese spoken dialogue corpus. Constructed using an automated, language-independent methodology, J-CHAT ensures acoustic cleanliness, diversity, and natural spontaneity. The corpus is built from YouTube and podcast data, with extensive filtering and denoising to enhance quality. Experimental results with generative spoken dialogue language models trained on J-CHAT demonstrate its effectiveness for SDS development. By providing a robust foundation for training advanced dialogue models, we anticipate that J-CHAT will drive progress in human-AI dialogue research and applications.

Comments: 8 pages, 3 figures

Subjects:

Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)

Cite as: arXiv:2407.15828 [cs.CL]

(or arXiv:2407.15828v2 [cs.CL] for this version)

https://doi.org/10.48550/arXiv.2407.15828

arXiv-issued DOI via DataCite

Submission history

From: Wataru Nakata [view email] [v1] Mon, 22 Jul 2024 17:46:50 UTC (4,158 KB) [v2] Thu, 2 Apr 2026 09:29:59 UTC (396 KB)

Original source

arXiv eess.AS

https://arxiv.org/abs/2407.15828

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modellanguage modeltraining

ModelsLive

Cheaper/faster/easier makes for step changes (and that's why even current-level LLMs are transformative)

We already knew there's nothing new under the sun. Thanks to advances in telescopes, orbital launch, satellites, and space vehicles we now know there's nothing new above the sun either, but there is rather a lot of energy! For many phenomena, I think it's a matter of convenience and utility where you model them as discrete or continuous, aka, qualitative vs quantitative. On one level, nukes are simply a bigger explosion, and we already had explosions. On another level, they're sufficiently bigger as to have reshaped global politics and rewritten the decision theory of modern war. Perhaps the key thing is remembering that sufficiently large quantitative changes can make for qualitative macro effects. For example, basic elements of modern life include transport, communication, energy, comput

LessWrong

6m39 minutes ago

ReleasesRecent

Anthropic announces free Claude update for Microsoft 365 users, details here - India Today

Anthropic announces free Claude update for Microsoft 365 users, details here India Today

Google News: Claude

1mabout 21 hours ago

Releases

Chinese company Z.ai announces open-source image generation AI 'GLM-Image,' a hybrid of autoregressive and diffusion models - GIGAZINE

Chinese company Z.ai announces open-source image generation AI 'GLM-Image,' a hybrid of autoregressive and diffusion models GIGAZINE

GNews AI diffusion

1m3 months ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 242 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

J-CHAT: Japanese Large-scale Spoken Dialogue Corpus for Spoken Dialogue Language Modeling

Submission history

Daily AI Digest

More about

Cheaper/faster/easier makes for step changes (and that's why even current-level LLMs are transformative)

Anthropic announces free Claude update for Microsoft 365 users, details here - India Today

Chinese company Z.ai announces open-source image generation AI 'GLM-Image,' a hybrid of autoregressive and diffusion models - GIGAZINE

Knowledge Map

Connected Articles — Knowledge Graph

Discussion

More in Models

Cheaper/faster/easier makes for step changes (and that's why even current-level LLMs are transformative)

Fears Over U.S. AI Dominance Boost Business for France’s Mistral - WSJ

'Europe needs AI cloud infrastructure': Mistral raises $830m for data centre near Paris - MSN

AI Hacker "Pliny the Liberator" Tests GPT-4 Security - StartupHub.ai