👁 Curated Catalog · Computer Vision

CIFAR-100

60,000 tiny 32×32 images across 100 balanced classes — a standard classification benchmark.

LQS 88 · gold ✓ Commercial OK 60K images 170 MB Binary · Pickle Released 2009

Browse commercial Computer Vision → Visit original source ↗

Source: cs.toronto.edu · maintained by Alex Krizhevsky / University of Toronto

About this dataset

CIFAR-100 is a small-image classification benchmark from the University of Toronto. 60,000 32×32 color images across 100 fine-grained classes, grouped into 20 superclasses, with exactly 600 images per class (500 train + 100 test). Commonly used for teaching, fast iteration, and regularization research.

Maintainer

Alex Krizhevsky / University of Toronto

License

MIT-style (unrestricted research use)

Formats

Binary · Pickle

Paper

Read on cs.toronto.edu →

LabelSets Quality Score

LQS is our 7-dimension quality score, computed from the dataset's published statistics. See methodology →

out of 100

gold tier

High-quality dataset across most dimensions

Composite score computed from the 7 dimensions below: completeness, uniqueness, validation health, size adequacy, format compliance, label density, and class balance.

Completeness 95

No public completeness metric; using prior for 'expert_curated' datasets.

Uniqueness 95

Manually vetted for uniqueness by maintainer.

Validation 92

Labels produced by domain experts or trained annotators.

Size adequacy 78

60,000 images — below 100,000 target for Computer Vision, but usable.

Format compliance 95

Industry-standard format — drop-in compatible with mainstream tooling.

Label density 52

Average 1.0 labels per item (sparse).

Class balance 100

Normalized class entropy 1.00 — well-balanced across classes.

What it's used for

Common tasks and benchmarks where CIFAR-100 is the default or competitive choice.

Image classification
Benchmarking
Regularization research

Sample statistics

What's actually in the dataset — from the maintainer's published stats.

100 balanced classes × 600 images = 60,000 total. Perfect class balance. 50K train + 10K test split.

License

CIFAR-100 is distributed under MIT-style (unrestricted research use). This is a third-party public dataset; LabelSets indexes and scores it but does not host or redistribute the data. Always verify current license terms with the maintainer before commercial use.

Need commercial-licensed Computer Vision data?

LabelSets sellers offer paid computer vision datasets with what public datasets often can't give you:

Explicit commercial license in writing
LQS-verified quality in your specific use-case
Instant download — no DUA, credentialed access, or research gating
PII scanned, deduplicated, and production-ready

Browse paid Computer Vision → Sell your dataset

Frequently Asked Questions

CIFAR-100 is distributed under MIT-style (unrestricted research use), which generally permits commercial use. Always verify the current license terms with the maintainer (Alex Krizhevsky / University of Toronto) before using in a commercial product.

CIFAR-100 contains 60,000 images. 100 balanced classes × 600 images = 60,000 total. Perfect class balance. 50K train + 10K test split.

CIFAR-100 is maintained by Alex Krizhevsky / University of Toronto and is available at https://www.cs.toronto.edu/~kriz/cifar.html. LabelSets indexes and scores this dataset for discoverability but does not redistribute it.

LQS is a 7-dimension quality score (completeness, uniqueness, validation, size adequacy, format compliance, label density, class balance) computed from the dataset's published statistics. Composite scores map to tiers: platinum (≥90), gold (≥75), silver (≥60), bronze (<60). Read the full methodology.

CIFAR-100

About this dataset

LabelSets Quality Score

High-quality dataset across most dimensions

What it's used for

Sample statistics

License

Need commercial-licensed Computer Vision data?

Similar public datasets

Frequently Asked Questions