Skip to main content
Datasetbenchmarksv1.0

ARC Dataset

by Allen Institute for AI · open-source · Last verified 2026-03-17

The AI2 Reasoning Challenge (ARC) dataset contains 7,787 grade 3–9 science exam questions split into Easy and Challenge partitions. The Challenge set contains questions that require deeper reasoning and world knowledge, making it a reliable signal for advanced language understanding.

https://huggingface.co/datasets/allenai/ai2_arc
B+
B+Good
Adoption: A+Quality: AFreshness: B+Citations: A+Engagement: F

Specifications

License
CC-BY-SA-4.0
Pricing
open-source
Capabilities
science-evaluation, reasoning-benchmark, multiple-choice-qa
Integrations
huggingface-datasets, lm-eval-harness
Use Cases
model-evaluation, science-reasoning, benchmarking
API Available
No
Tags
benchmark, science-questions, multiple-choice, reasoning, ai2
Added
2026-03-17
Completeness
100%

Index Score

76.2
Adoption
90
Quality
87
Freshness
71
Citations
91
Engagement
0

Put AI to work for your business

Deploy this dataset alongside autonomous AaaS agents that handle tasks end-to-end — no babysitting required.

Explore the full AI ecosystem on Agents as a Service