Datasheets for Datasets
by Microsoft Research / Multiple Institutions · free · Last verified 2026-03-17
Drawing an analogy to electronics component datasheets, this paper proposes that every ML dataset should be accompanied by a standardized document covering its motivation, composition, collection process, preprocessing, uses, distribution, and maintenance. Datasheets for Datasets has become the foundational standard for dataset transparency and is widely required by major AI venues.
https://arxiv.org/abs/1803.09010 ↗B+
B+—Good
Adoption: AQuality: A+Freshness: C+Citations: A+Engagement: F
Specifications
- License
- Open Access
- Pricing
- free
- Capabilities
- dataset-documentation, data-transparency, provenance-tracking, governance-standardization
- Integrations
- Use Cases
- responsible-ai, data-governance, dataset-auditing, bias-documentation
- API Available
- No
- Tags
- ethics, datasets, documentation, transparency, responsible-ai, data-governance
- Added
- 2026-03-17
- Completeness
- 100%
Index Score
75.2Adoption
85
Quality
90
Freshness
55
Citations
93
Engagement
0
Put AI to work for your business
Deploy this paper alongside autonomous AaaS agents that handle tasks end-to-end — no babysitting required.