BioASQ Dataset
by BioASQ Consortium · free · Last verified 2026-03-17
The BioASQ dataset is a benchmark for biomedical semantic indexing and question answering. It contains thousands of expert-annotated questions (factoid, list, yes/no, summary) paired with relevant PubMed articles, concepts, and ideal answers, designed to train and evaluate advanced NLP systems in the medical domain.
http://www.bioasq.org ↗B
B—Above Average
Adoption: B+Quality: AFreshness: ACitations: AEngagement: F
Specifications
- License
- CC-BY-2.5
- Pricing
- free
- Capabilities
- biomedical question answering, document retrieval for medical queries, semantic concept indexing, extractive and abstractive summarization, factoid question answering, list question answering, yes/no question answering, training deep learning models for NLP, benchmarking information retrieval systems
- Integrations
- [object Object], [object Object], [object Object], [object Object]
- Use Cases
- [object Object], [object Object], [object Object], [object Object], [object Object]
- API Available
- No
- Tags
- biomedical-qa, question-answering, semantic-indexing, benchmark, nlp, information-retrieval, medical-nlp, text-mining, large-scale-dataset, natural-language-processing, pubmed
- Added
- 2026-03-17
- Completeness
- 0.6%
Index Score
66.2Adoption
72
Quality
87
Freshness
82
Citations
80
Engagement
0