Home·Curated Catalog·Computer Vision
👁 Curated Catalog · Computer Vision

ImageNet

14.2M hand-annotated images across 21K categories — the dataset that launched deep learning.

LQS 83 · gold ⚠ Research-only 14.2M images 155 GB JPG · XML Released 2009
Browse commercial Computer Vision → Visit original source ↗
Source: image-net.org · maintained by Stanford Vision Lab / Princeton University
14.2M
images
155 GB
Size on disk
83
LQS · gold
2009
First released

About this dataset

ImageNet is the seminal large-scale image classification dataset maintained by Stanford Vision Lab. It contains 14.2M images hand-annotated across 21,841 WordNet synset categories. The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) subset of 1.2M images / 1000 classes remains the canonical benchmark for image classification and drove the deep learning revolution via AlexNet in 2012.

Formats
JPG · XML

LabelSets Quality Score

LQS is our 7-dimension quality score, computed from the dataset's published statistics. See methodology →

83
out of 100
gold tier

Solid dataset with some trade-offs

Composite score computed from the 7 dimensions below: completeness, uniqueness, validation health, size adequacy, format compliance, label density, and class balance.

Completeness 88
No public completeness metric; using prior for 'crowdsourced_qc' datasets.
Uniqueness 93
Exact-hash deduplication documented by maintainer.
Validation 82
Crowdsourced labels with quality-control protocol (redundancy, golden tests).
Size adequacy 96
14,200,000 images — exceeds 100,000 adequacy target for Computer Vision.
Format compliance 95
Industry-standard format — drop-in compatible with mainstream tooling.
Label density 52
Average 1.0 labels per item (sparse).
Class balance 58
Long-tail distribution — dominant classes overrepresented.

What it's used for

Common tasks and benchmarks where ImageNet is the default or competitive choice.

Sample statistics

What's actually in the dataset — from the maintainer's published stats.

14.2M images across 21,841 synsets. ILSVRC subset: 1.28M train, 50K val, 100K test across 1000 classes.

License

ImageNet is distributed under Custom (non-commercial research). This is a third-party public dataset; LabelSets indexes and scores it but does not host or redistribute the data. Always verify current license terms with the maintainer before commercial use.

Heads up: this dataset's license restricts commercial use. If you need computer vision data for production, check LabelSets' paid datasets below — every listing has an explicit commercial license.

Need commercial-licensed Computer Vision data?

LabelSets sellers offer paid computer vision datasets with what public datasets often can't give you:

Browse paid Computer Vision → Sell your dataset

Similar public datasets

Other entries in the Computer Vision catalog.

Frequently Asked Questions

ImageNet is distributed under Custom (non-commercial research), which restricts commercial use. For a commercially-licensed alternative in computer vision, see LabelSets' paid datasets.
ImageNet contains 14,200,000 images. 14.2M images across 21,841 synsets. ILSVRC subset: 1.28M train, 50K val, 100K test across 1000 classes.
ImageNet is maintained by Stanford Vision Lab / Princeton University and is available at https://www.image-net.org/download.php. LabelSets indexes and scores this dataset for discoverability but does not redistribute it.
LQS is a 7-dimension quality score (completeness, uniqueness, validation, size adequacy, format compliance, label density, class balance) computed from the dataset's published statistics. Composite scores map to tiers: platinum (≥90), gold (≥75), silver (≥60), bronze (<60). Read the full methodology.