Home·Curated Catalog·Computer Vision
👁 Curated Catalog · Computer Vision

COCO — Common Objects in Context

Large-scale object detection, segmentation, and captioning dataset from Microsoft Research.

LQS 89 · gold ✓ Commercial OK 330K images 25 GB JSON · JPG Released 2014
Browse commercial Computer Vision → Visit original source ↗
Source: cocodataset.org · maintained by Microsoft COCO Consortium
330K
images
25 GB
Size on disk
89
LQS · gold
2014
First released

About this dataset

COCO is one of the most widely-used object detection benchmarks in computer vision. Released by Microsoft Research in 2014 and expanded since, it contains 330K images with 2.5M labeled object instances across 80 common object categories, plus 250K people with keypoints and 1.5M object segmentation masks. It's the de facto benchmark for object detection, instance segmentation, and image captioning models.

License
Formats
JSON · JPG

LabelSets Quality Score

LQS is our 7-dimension quality score, computed from the dataset's published statistics. See methodology →

89
out of 100
gold tier

High-quality dataset across most dimensions

Composite score computed from the 7 dimensions below: completeness, uniqueness, validation health, size adequacy, format compliance, label density, and class balance.

Completeness 99
Published by maintainer: 99% completeness across annotated fields.
Uniqueness 93
Exact-hash deduplication documented by maintainer.
Validation 82
Crowdsourced labels with quality-control protocol (redundancy, golden tests).
Size adequacy 92
330,000 images — exceeds 100,000 adequacy target for Computer Vision.
Format compliance 95
Industry-standard format — drop-in compatible with mainstream tooling.
Label density 78
Average 7.7 labels per item (high density).
Class balance 75
Moderate class skew — realistic production distribution.

What it's used for

Common tasks and benchmarks where COCO — Common Objects in Context is the default or competitive choice.

Sample statistics

What's actually in the dataset — from the maintainer's published stats.

330K images, 2.5M object instances, 80 categories, ~7.7 objects per image, 250K people with 17 keypoints each, 1.5M segmentation masks.

License

COCO — Common Objects in Context is distributed under CC BY 4.0. This is a third-party public dataset; LabelSets indexes and scores it but does not host or redistribute the data. Always verify current license terms with the maintainer before commercial use.

Need commercial-licensed Computer Vision data?

LabelSets sellers offer paid computer vision datasets with what public datasets often can't give you:

Browse paid Computer Vision → Sell your dataset

Similar public datasets

Other entries in the Computer Vision catalog.

Frequently Asked Questions

COCO — Common Objects in Context is distributed under CC BY 4.0, which generally permits commercial use. Always verify the current license terms with the maintainer (Microsoft COCO Consortium) before using in a commercial product.
COCO — Common Objects in Context contains 330,000 images. 330K images, 2.5M object instances, 80 categories, ~7.7 objects per image, 250K people with 17 keypoints each, 1.5M segmentation masks.
COCO — Common Objects in Context is maintained by Microsoft COCO Consortium and is available at https://cocodataset.org/#download. LabelSets indexes and scores this dataset for discoverability but does not redistribute it.
LQS is a 7-dimension quality score (completeness, uniqueness, validation, size adequacy, format compliance, label density, class balance) computed from the dataset's published statistics. Composite scores map to tiers: platinum (≥90), gold (≥75), silver (≥60), bronze (<60). Read the full methodology.