Skip to main content
Tooltesting-evaluationv1.0

lm-evaluation-harness

by EleutherAI · open-source · Last verified 2026-04-24

EleutherAI's standardized framework for evaluating LLMs on 200+ benchmarks.

https://github.com/EleutherAI/lm-evaluation-harness
D
DPoor
Adoption: C+Quality: B+Freshness: ACitations: FEngagement: F

Specifications

License
Open Source
Pricing
open-source
Capabilities
Integrations
Use Cases
API Available
No
SDK Languages
Tags
evaluation, benchmarks, eleutherai, python
Added
2026-04-24
Completeness
80%

Index Score

35
Adoption
50
Quality
70
Freshness
80
Citations
4
Engagement
0

Need this tool deployed for your team?

Get a Custom Setup

Explore the full AI ecosystem on Agents as a Service