An independently-curated index of public datasets used across ML research and production. Every entry has verified attribution, license info, and a LabelSets Quality Score. We don't host these — we link to the original maintainer.
Browse paid datasets → Jump to catalog ↓LabelSets' paid marketplace offers LQS-verified datasets with explicit commercial licenses, instant download, and no research-only restrictions — from $29 to flagship-tier.
Browse paid datasets → Sell your datasetThese are third-party public datasets maintained by universities, research labs, companies, and consortia. LabelSets indexes them for discoverability and applies our 7-dimension LabelSets Quality Score (LQS) so you can compare quality across sources.
We do not redistribute these datasets. Each entry links to the original maintainer; download and licensing happen on their platform. Our job is to make these easier to find, compare, and evaluate.
If you maintain a public dataset and want it added (or corrected), email us. If you need commercial-licensed data for production ML, browse our paid marketplace.