Datasetbenchmarksv1.0

ARC Dataset

by Allen Institute for AI · open-source · Last verified 2026-03-17

The AI2 Reasoning Challenge (ARC) dataset contains 7,787 grade 3–9 science exam questions split into Easy and Challenge partitions. The Challenge set contains questions that require deeper reasoning and world knowledge, making it a reliable signal for advanced language understanding.

https://huggingface.co/datasets/allenai/ai2_arc ↗

C+

C+—Average

Adoption: A+Quality: AFreshness: B+Citations: FEngagement: F

Specifications

License: CC-BY-SA-4.0
Pricing: open-source
Capabilities: science-evaluation, reasoning-benchmark, multiple-choice-qa
Integrations: huggingface-datasets, lm-eval-harness
Use Cases: model-evaluation, science-reasoning, benchmarking
API Available: No
Tags: benchmark, science-questions, multiple-choice, reasoning, ai2
Added: 2026-03-17
Completeness: 100%

Index Score

Adoption

Quality

Freshness

Citations

Engagement

Need this tool deployed for your team?

Get a Custom Setup

Explore the full AI ecosystem on Agents as a Service