Crowdsourced multilingual voice dataset — 20K+ hours across 100+ languages.
Browse commercial Audio → Visit original source ↗Common Voice is Mozilla's crowdsourced multilingual voice dataset. Contributors read short sentences out loud; other contributors validate the clips. Latest release: 20K+ validated hours across 100+ languages, with demographic metadata (age, gender, accent). All clips are CC0, making Common Voice one of the few speech datasets safe for commercial use.
LQS is our 7-dimension quality score, computed from the dataset's published statistics. See methodology →
Composite score computed from the 7 dimensions below: completeness, uniqueness, validation health, size adequacy, format compliance, label density, and class balance.
Common tasks and benchmarks where Mozilla Common Voice is the default or competitive choice.
What's actually in the dataset — from the maintainer's published stats.
Mozilla Common Voice is distributed under CC0 1.0. This is a third-party public dataset; LabelSets indexes and scores it but does not host or redistribute the data. Always verify current license terms with the maintainer before commercial use.
LabelSets sellers offer paid audio datasets with what public datasets often can't give you:
Other entries in the Audio catalog.