Skip to main content

Tooltesting-evaluationv1.0

lm-evaluation-harness

by EleutherAI · open-source · Last verified 2026-04-24

EleutherAI's standardized framework for evaluating LLMs on 200+ benchmarks.

https://github.com/EleutherAI/lm-evaluation-harness ↗

D

D—Poor

Adoption: C+Quality: B+Freshness: ACitations: FEngagement: F

Specifications

License: Open Source
Pricing: open-source
Capabilities
Integrations
Use Cases
API Available: No
SDK Languages
Tags: evaluation, benchmarks, eleutherai, python
Added: 2026-04-24
Completeness: 80%

Index Score

35

Adoption

50

Quality

70

Freshness

80

Citations

4

Engagement

0

Need this tool deployed for your team?

Get a Custom Setup

Explore the full AI ecosystem on Agents as a Service