CaseText Corpus
by Casetext (acquired by Thomson Reuters) · unknown · Last verified 2026-03-17
The CaseText Corpus is a large-scale dataset of US federal and state court decisions. It includes full text, structured metadata, and citation networks, designed for legal research and the development of AI applications like legal language models and case retrieval systems, spanning decades of US jurisprudence.
https://casetext.com ↗B
B—Above Average
Adoption: B+Quality: AFreshness: ACitations: B+Engagement: F
Specifications
- License
- Custom (API access)
- Pricing
- unknown
- Capabilities
- legal-language-modeling, case-law-retrieval, citation-network-analysis, legal-named-entity-recognition, automated-case-summarization, legal-topic-modeling, precedent-analysis, citation-recommendation
- Integrations
- [object Object], [object Object], [object Object], [object Object], [object Object]
- Use Cases
- [object Object], [object Object], [object Object], [object Object], [object Object]
- API Available
- Yes
- Tags
- case-law, legal-research, case-retrieval, us-law, nlp, corpus, legal-tech, citation-network, court-decisions, large-language-model, computational-law
- Added
- 2026-03-17
- Completeness
- 0.65%
Index Score
64.7Adoption
74
Quality
88
Freshness
85
Citations
70
Engagement
0