🎙 Curated Catalog · Audio

LibriSpeech

1,000 hours of read English audiobook speech — the standard ASR benchmark.

LQS 89 · gold ✓ Commercial OK 292K speech utterances 60 GB FLAC · TXT Released 2015

Browse commercial Audio → Visit original source ↗

Source: openslr.org · maintained by Vassil Panayotov (JHU) et al.

About this dataset

LibriSpeech is a 1,000-hour corpus of English read speech derived from public-domain LibriVox audiobooks, sampled at 16kHz. Widely used as the standard ASR training and evaluation set. Split into train-clean-100, train-clean-360, train-other-500, dev-clean, dev-other, test-clean, test-other for benchmarking on clean vs. noisy conditions.

Maintainer

Vassil Panayotov (JHU) et al.

License

CC BY 4.0

Formats

FLAC · TXT

Paper

Read on danielpovey.com →

LabelSets Quality Score

LQS is our 7-dimension quality score, computed from the dataset's published statistics. See methodology →

out of 100

gold tier

High-quality dataset across most dimensions

Composite score computed from the 7 dimensions below: completeness, uniqueness, validation health, size adequacy, format compliance, label density, and class balance.

Completeness 95

No public completeness metric; using prior for 'expert_curated' datasets.

Uniqueness 90

Benchmark-grade splits with leakage prevention.

Validation 95

Multiple expert annotators with reconciliation pass.

Size adequacy 91

292,367 segments — exceeds 100,000 adequacy target for Audio.

Format compliance 95

Industry-standard format — drop-in compatible with mainstream tooling.

Label density 52

Average 1.0 labels per item (sparse).

Class balance 90

Near-uniform class distribution.

What it's used for

Common tasks and benchmarks where LibriSpeech is the default or competitive choice.

Automatic speech recognition
Speech-to-text
Speaker identification

Sample statistics

What's actually in the dataset — from the maintainer's published stats.

1,000 hours / 292K utterances across 2,484 speakers. 16kHz. 7 splits for clean/other × train/dev/test.

License

LibriSpeech is distributed under CC BY 4.0. This is a third-party public dataset; LabelSets indexes and scores it but does not host or redistribute the data. Always verify current license terms with the maintainer before commercial use.

Need commercial-licensed Audio data?

LabelSets sellers offer paid audio datasets with what public datasets often can't give you:

Explicit commercial license in writing
LQS-verified quality in your specific use-case
Instant download — no DUA, credentialed access, or research gating
PII scanned, deduplicated, and production-ready

Browse paid Audio → Sell your dataset

Frequently Asked Questions

LibriSpeech is distributed under CC BY 4.0, which generally permits commercial use. Always verify the current license terms with the maintainer (Vassil Panayotov (JHU) et al.) before using in a commercial product.

LibriSpeech contains 292,367 speech utterances. 1,000 hours / 292K utterances across 2,484 speakers. 16kHz. 7 splits for clean/other × train/dev/test.

LibriSpeech is maintained by Vassil Panayotov (JHU) et al. and is available at https://www.openslr.org/12/. LabelSets indexes and scores this dataset for discoverability but does not redistribute it.

LQS is a 7-dimension quality score (completeness, uniqueness, validation, size adequacy, format compliance, label density, class balance) computed from the dataset's published statistics. Composite scores map to tiers: platinum (≥90), gold (≥75), silver (≥60), bronze (<60). Read the full methodology.

LibriSpeech

About this dataset

LabelSets Quality Score

High-quality dataset across most dimensions

What it's used for

Sample statistics

License

Need commercial-licensed Audio data?

Similar public datasets

Frequently Asked Questions