Datasetlegalv2023

CaseText Corpus

by Casetext (acquired by Thomson Reuters) · unknown · Last verified 2026-03-17

The CaseText Corpus is a large-scale dataset of US federal and state court decisions. It includes full text, structured metadata, and citation networks, designed for legal research and the development of AI applications like legal language models and case retrieval systems, spanning decades of US jurisprudence.

https://casetext.com ↗

C—Below Average

Adoption: B+Quality: AFreshness: ACitations: FEngagement: F

Specifications

License: Custom (API access)
Pricing: unknown
Capabilities: legal-language-modeling, case-law-retrieval, citation-network-analysis, legal-named-entity-recognition, automated-case-summarization, legal-topic-modeling, precedent-analysis, citation-recommendation
Integrations: [object Object], [object Object], [object Object], [object Object], [object Object]
Use Cases: [object Object], [object Object], [object Object], [object Object], [object Object]
API Available: Yes
Tags: case-law, legal-research, case-retrieval, us-law, nlp, corpus, legal-tech, citation-network, court-decisions, large-language-model, computational-law
Added: 2026-03-17
Completeness: 0.65%

Index Score

Adoption

Quality

Freshness

Citations

Engagement

Need this tool deployed for your team?

Get a Custom Setup

Explore the full AI ecosystem on Agents as a Service