Skip to main content
brand
context
industry
strategy
AaaS
Datasetbenchmarksv1.0

ARC Dataset

by Allen Institute for AI · open-source · Last verified 2026-03-17

The AI2 Reasoning Challenge (ARC) dataset contains 7,787 grade 3–9 science exam questions split into Easy and Challenge partitions. The Challenge set contains questions that require deeper reasoning and world knowledge, making it a reliable signal for advanced language understanding.

https://huggingface.co/datasets/allenai/ai2_arc
B+
B+Good
Adoption: A+Quality: AFreshness: B+Citations: A+Engagement: F

Specifications

License
CC-BY-SA-4.0
Pricing
open-source
Capabilities
science-evaluation, reasoning-benchmark, multiple-choice-qa
Integrations
huggingface-datasets, lm-eval-harness
Use Cases
model-evaluation, science-reasoning, benchmarking
API Available
No
Tags
benchmark, science-questions, multiple-choice, reasoning, ai2
Added
2026-03-17
Completeness
100%

Index Score

76.2
Adoption
90
Quality
87
Freshness
71
Citations
91
Engagement
0

Need this tool deployed for your team?

Get a Custom Setup

Explore the full AI ecosystem on Agents as a Service