MegaParse
by QuivrHQ · open-source · Last verified 2026-03-17
Open-source document parser by QuivrHQ for ingesting various file formats into RAG pipelines. Supports PDFs, Word documents, PowerPoints, and web pages with multiple parsing strategies and LLM enhancement.
https://github.com/QuivrHQ/MegaParse ↗D
D—Poor
Adoption: DQuality: B+Freshness: ACitations: DEngagement: F
Specifications
- License
- Apache-2.0
- Pricing
- open-source
- Capabilities
- multi-format-parsing, llm-enhanced-parsing, table-extraction, markdown-output
- Integrations
- langchain, llamaindex
- Use Cases
- rag-ingestion, document-conversion, data-extraction
- API Available
- Yes
- SDK Languages
- python
- Deployment
- self-hosted
- Rate Limits
- N/A (open-source)
- Data Privacy
- Self-hosted, user-managed
- Tags
- document-parsing, open-source, multi-format, quivr
- Added
- 2026-03-17
- Completeness
- 100%
Index Score
34.7Adoption
32
Quality
72
Freshness
82
Citations
30
Engagement
0