Arena-Hard Auto
by [unverified] · unknown · Last verified 2026-03-26T21:48:00.934Z
Automated benchmark derived from Chatbot Arena for evaluating instruction-following and open-ended generation.
https://lmarena.ai ↗F
F—Critical
Adoption: FQuality: AFreshness: A+Citations: FEngagement: F
Specifications
- Pricing
- unknown
- Capabilities
- unverified
- Integrations
- Use Cases
- API Available
- No
- Tags
- evaluation, instruction, automated
- Added
- 2026-03-26T21:48:00.934Z
- Completeness
- 73%
Index Score
16Adoption
0
Quality
80
Freshness
100
Citations
0
Engagement
0