Skip to main content
Datasetfinancialv2024

SEC-EDGAR Filings

by U.S. Securities and Exchange Commission · free · Last verified 2026-03-17

The SEC-EDGAR Filings dataset encompasses over 20 million full-text regulatory filings submitted to the US Securities and Exchange Commission since 1993, including 10-K annual reports, 10-Q quarterly reports, 8-K current reports, and proxy statements from all US public companies. It is the foundational corpus for financial NLP research, sentiment analysis, and financial document AI.

https://www.sec.gov/edgar/
B+
B+Good
Adoption: AQuality: AFreshness: A+Citations: AEngagement: F

Specifications

License
Public Domain (US Government Works)
Pricing
free
Capabilities
financial-document-analysis, sentiment-analysis, risk-factor-extraction, financial-forecasting
Integrations
EDGAR API, sec-edgar-downloader (Python), HuggingFace Datasets
Use Cases
financial-nlp-research, investment-analysis, regulatory-compliance, model-training
API Available
Yes
Tags
financial-nlp, 10-K, 10-Q, earnings, regulatory-filings, SEC
Added
2026-03-17
Completeness
100%

Index Score

72.5
Adoption
86
Quality
88
Freshness
98
Citations
82
Engagement
0

Put AI to work for your business

Deploy this dataset alongside autonomous AaaS agents that handle tasks end-to-end — no babysitting required.

Explore the full AI ecosystem on Agents as a Service