🎙 Curated Catalog · Audio

Mozilla Common Voice

Crowdsourced multilingual voice dataset — 20K+ hours across 100+ languages.

LQS 79 · gold ✓ Commercial OK 14M validated clips 150 GB MP3 · TSV Released 2017

Browse commercial Audio → Visit original source ↗

Source: commonvoice.mozilla.org · maintained by Mozilla Foundation

About this dataset

Common Voice is Mozilla's crowdsourced multilingual voice dataset. Contributors read short sentences out loud; other contributors validate the clips. Latest release: 20K+ validated hours across 100+ languages, with demographic metadata (age, gender, accent). All clips are CC0, making Common Voice one of the few speech datasets safe for commercial use.

Maintainer

Mozilla Foundation

License

CC0 1.0

Formats

MP3 · TSV

Paper

Read on arxiv.org →

LabelSets Quality Score

LQS is our 7-dimension quality score, computed from the dataset's published statistics. See methodology →

out of 100

gold tier

Solid dataset with some trade-offs

Composite score computed from the 7 dimensions below: completeness, uniqueness, validation health, size adequacy, format compliance, label density, and class balance.

Completeness 88

No public completeness metric; using prior for 'crowdsourced_qc' datasets.

Uniqueness 68

Minimal deduplication disclosed.

Validation 82

Crowdsourced labels with quality-control protocol (redundancy, golden tests).

Size adequacy 96

14,000,000 clips — exceeds 100,000 adequacy target for Audio.

Format compliance 95

Industry-standard format — drop-in compatible with mainstream tooling.

Label density 52

Average 1.0 labels per item (sparse).

Class balance 58

Long-tail distribution — dominant classes overrepresented.

What it's used for

Common tasks and benchmarks where Mozilla Common Voice is the default or competitive choice.

Multilingual ASR
Speaker verification
Accent research
Voice biometrics

Sample statistics

What's actually in the dataset — from the maintainer's published stats.

20K+ validated hours, 100+ languages, 14M+ validated clips. English alone: 3,200+ hours. Demographic metadata on voluntary basis.

License

Mozilla Common Voice is distributed under CC0 1.0. This is a third-party public dataset; LabelSets indexes and scores it but does not host or redistribute the data. Always verify current license terms with the maintainer before commercial use.

Need commercial-licensed Audio data?

LabelSets sellers offer paid audio datasets with what public datasets often can't give you:

Explicit commercial license in writing
LQS-verified quality in your specific use-case
Instant download — no DUA, credentialed access, or research gating
PII scanned, deduplicated, and production-ready

Browse paid Audio → Sell your dataset

Frequently Asked Questions

Mozilla Common Voice is distributed under CC0 1.0, which generally permits commercial use. Always verify the current license terms with the maintainer (Mozilla Foundation) before using in a commercial product.

Mozilla Common Voice contains 14,000,000 validated clips. 20K+ validated hours, 100+ languages, 14M+ validated clips. English alone: 3,200+ hours. Demographic metadata on voluntary basis.

Mozilla Common Voice is maintained by Mozilla Foundation and is available at https://commonvoice.mozilla.org/en/datasets. LabelSets indexes and scores this dataset for discoverability but does not redistribute it.

LQS is a 7-dimension quality score (completeness, uniqueness, validation, size adequacy, format compliance, label density, class balance) computed from the dataset's published statistics. Composite scores map to tiers: platinum (≥90), gold (≥75), silver (≥60), bronze (<60). Read the full methodology.

Mozilla Common Voice

About this dataset

LabelSets Quality Score

Solid dataset with some trade-offs

What it's used for

Sample statistics

License

Need commercial-licensed Audio data?

Similar public datasets

Frequently Asked Questions