lm-evaluation-harness
by EleutherAI · open-source · Last verified 2026-04-24
EleutherAI's standardized framework for evaluating LLMs on 200+ benchmarks.
https://github.com/EleutherAI/lm-evaluation-harness ↗D
D—Poor
Adoption: C+Quality: B+Freshness: ACitations: FEngagement: F
Specifications
- License
- Open Source
- Pricing
- open-source
- Capabilities
- Integrations
- Use Cases
- API Available
- No
- SDK Languages
- Tags
- evaluation, benchmarks, eleutherai, python
- Added
- 2026-04-24
- Completeness
- 80%
Index Score
35Adoption
50
Quality
70
Freshness
80
Citations
4
Engagement
0