Skip to main content
Datasetscientificv2026.01

UniProt

by UniProt Consortium (EMBL-EBI / SIB / PIR) · free · Last verified 2026-03-17

UniProt (Universal Protein Resource) is the world's comprehensive, freely accessible protein sequence and functional information database, maintained by a consortium of EMBL-EBI, SIB, and PIR. It contains over 250 million protein sequences in UniParc, with 570,000+ manually reviewed entries in SwissProt providing expert-curated functional annotations, and serves as the gold-standard training source for protein language models.

https://www.uniprot.org
A
AGreat
Adoption: A+Quality: A+Freshness: A+Citations: A+Engagement: F

Specifications

License
CC BY 4.0
Pricing
free
Capabilities
protein-sequence-retrieval, functional-annotation, taxonomy-search
Integrations
biopython, alphafold
Use Cases
protein-language-model-training, drug-target-identification, functional-genomics
API Available
Yes
Tags
proteins, biology, sequences, functional-annotation, bioinformatics
Added
2026-03-17
Completeness
100%

Index Score

80.9
Adoption
93
Quality
97
Freshness
92
Citations
97
Engagement
0

Put AI to work for your business

Deploy this dataset alongside autonomous AaaS agents that handle tasks end-to-end — no babysitting required.

Explore the full AI ecosystem on Agents as a Service