Unstructured
by Unstructured · open-source · Last verified 2026-04-24
ETL library parsing PDFs, Office docs, and HTML into structured elements.
https://unstructured.io ↗C
C—Below Average
Adoption: C+Quality: B+Freshness: ACitations: CEngagement: F
Specifications
- License
- Open Source
- Pricing
- open-source
- Capabilities
- Integrations
- Use Cases
- API Available
- No
- SDK Languages
- python
- Deployment
- self-hosted, cloud-api, docker
- Rate Limits
- Free tier: 1K pages/month; paid plans scale
- Data Privacy
- SOC 2 Type II; self-hosted option for data control
- Tags
- etl, parsing, documents, python
- Added
- 2026-04-24
- Completeness
- 60%
Index Score
44Adoption
50
Quality
70
Freshness
80
Citations
40
Engagement
0