Home·Curated Catalog·Computer Vision
👁 Curated Catalog · Computer Vision

CIFAR-100

60,000 tiny 32×32 images across 100 balanced classes — a standard classification benchmark.

LQS 88 · gold ✓ Commercial OK 60K images 170 MB Binary · Pickle Released 2009
Browse commercial Computer Vision → Visit original source ↗
Source: cs.toronto.edu · maintained by Alex Krizhevsky / University of Toronto
60K
images
170 MB
Size on disk
88
LQS · gold
2009
First released

About this dataset

CIFAR-100 is a small-image classification benchmark from the University of Toronto. 60,000 32×32 color images across 100 fine-grained classes, grouped into 20 superclasses, with exactly 600 images per class (500 train + 100 test). Commonly used for teaching, fast iteration, and regularization research.

Formats
Binary · Pickle

LabelSets Quality Score

LQS is our 7-dimension quality score, computed from the dataset's published statistics. See methodology →

88
out of 100
gold tier

High-quality dataset across most dimensions

Composite score computed from the 7 dimensions below: completeness, uniqueness, validation health, size adequacy, format compliance, label density, and class balance.

Completeness 95
No public completeness metric; using prior for 'expert_curated' datasets.
Uniqueness 95
Manually vetted for uniqueness by maintainer.
Validation 92
Labels produced by domain experts or trained annotators.
Size adequacy 78
60,000 images — below 100,000 target for Computer Vision, but usable.
Format compliance 95
Industry-standard format — drop-in compatible with mainstream tooling.
Label density 52
Average 1.0 labels per item (sparse).
Class balance 100
Normalized class entropy 1.00 — well-balanced across classes.

What it's used for

Common tasks and benchmarks where CIFAR-100 is the default or competitive choice.

Sample statistics

What's actually in the dataset — from the maintainer's published stats.

100 balanced classes × 600 images = 60,000 total. Perfect class balance. 50K train + 10K test split.

License

CIFAR-100 is distributed under MIT-style (unrestricted research use). This is a third-party public dataset; LabelSets indexes and scores it but does not host or redistribute the data. Always verify current license terms with the maintainer before commercial use.

Need commercial-licensed Computer Vision data?

LabelSets sellers offer paid computer vision datasets with what public datasets often can't give you:

Browse paid Computer Vision → Sell your dataset

Similar public datasets

Other entries in the Computer Vision catalog.

Frequently Asked Questions

CIFAR-100 is distributed under MIT-style (unrestricted research use), which generally permits commercial use. Always verify the current license terms with the maintainer (Alex Krizhevsky / University of Toronto) before using in a commercial product.
CIFAR-100 contains 60,000 images. 100 balanced classes × 600 images = 60,000 total. Perfect class balance. 50K train + 10K test split.
CIFAR-100 is maintained by Alex Krizhevsky / University of Toronto and is available at https://www.cs.toronto.edu/~kriz/cifar.html. LabelSets indexes and scores this dataset for discoverability but does not redistribute it.
LQS is a 7-dimension quality score (completeness, uniqueness, validation, size adequacy, format compliance, label density, class balance) computed from the dataset's published statistics. Composite scores map to tiers: platinum (≥90), gold (≥75), silver (≥60), bronze (<60). Read the full methodology.