Skip to main content
Paperethicsv1.0

Datasheets for Datasets

by Microsoft Research / Multiple Institutions · free · Last verified 2026-03-17

Drawing an analogy to electronics component datasheets, this paper proposes that every ML dataset should be accompanied by a standardized document covering its motivation, composition, collection process, preprocessing, uses, distribution, and maintenance. Datasheets for Datasets has become the foundational standard for dataset transparency and is widely required by major AI venues.

https://arxiv.org/abs/1803.09010
B+
B+Good
Adoption: AQuality: A+Freshness: C+Citations: A+Engagement: F

Specifications

License
Open Access
Pricing
free
Capabilities
dataset-documentation, data-transparency, provenance-tracking, governance-standardization
Integrations
Use Cases
responsible-ai, data-governance, dataset-auditing, bias-documentation
API Available
No
Tags
ethics, datasets, documentation, transparency, responsible-ai, data-governance
Added
2026-03-17
Completeness
100%

Index Score

75.2
Adoption
85
Quality
90
Freshness
55
Citations
93
Engagement
0

Put AI to work for your business

Deploy this paper alongside autonomous AaaS agents that handle tasks end-to-end — no babysitting required.

Explore the full AI ecosystem on Agents as a Service