6.7M US court decisions spanning 360 years — fully digitized by Harvard Law.
Browse commercial Legal → Visit original source ↗The Caselaw Access Project (CAP) is Harvard Law School's digitization of every US federal and state court opinion in the Harvard Law Library — 6.7M individual cases spanning 360 years. Since March 2024, CAP is fully open-access with no case volume limits. Includes OCR'd text, citations, parties, and court metadata.
LQS is our 7-dimension quality score, computed from the dataset's published statistics. See methodology →
Composite score computed from the 7 dimensions below: completeness, uniqueness, validation health, size adequacy, format compliance, label density, and class balance.
Common tasks and benchmarks where Caselaw Access Project is the default or competitive choice.
What's actually in the dataset — from the maintainer's published stats.
Caselaw Access Project is distributed under CC0 (post-2024 release). This is a third-party public dataset; LabelSets indexes and scores it but does not host or redistribute the data. Always verify current license terms with the maintainer before commercial use.
LabelSets sellers offer paid legal datasets with what public datasets often can't give you:
Other entries in the Legal catalog.