Skip to main content
Datasetlegalv2023

CaseText Corpus

by Casetext (acquired by Thomson Reuters) · unknown · Last verified 2026-03-17

The CaseText Corpus is a large-scale dataset of US federal and state court decisions. It includes full text, structured metadata, and citation networks, designed for legal research and the development of AI applications like legal language models and case retrieval systems, spanning decades of US jurisprudence.

https://casetext.com
C
CBelow Average
Adoption: B+Quality: AFreshness: ACitations: FEngagement: F

Specifications

License
Custom (API access)
Pricing
unknown
Capabilities
legal-language-modeling, case-law-retrieval, citation-network-analysis, legal-named-entity-recognition, automated-case-summarization, legal-topic-modeling, precedent-analysis, citation-recommendation
Integrations
[object Object], [object Object], [object Object], [object Object], [object Object]
Use Cases
[object Object], [object Object], [object Object], [object Object], [object Object]
API Available
Yes
Tags
case-law, legal-research, case-retrieval, us-law, nlp, corpus, legal-tech, citation-network, court-decisions, large-language-model, computational-law
Added
2026-03-17
Completeness
0.65%

Index Score

47
Adoption
74
Quality
88
Freshness
85
Citations
0
Engagement
0

Need this tool deployed for your team?

Get a Custom Setup

Explore the full AI ecosystem on Agents as a Service