UniProt
by UniProt Consortium (EMBL-EBI / SIB / PIR) · free · Last verified 2026-03-17
UniProt (Universal Protein Resource) is the world's comprehensive, freely accessible protein sequence and functional information database, maintained by a consortium of EMBL-EBI, SIB, and PIR. It contains over 250 million protein sequences in UniParc, with 570,000+ manually reviewed entries in SwissProt providing expert-curated functional annotations, and serves as the gold-standard training source for protein language models.
https://www.uniprot.org ↗A
A—Great
Adoption: A+Quality: A+Freshness: A+Citations: A+Engagement: F
Specifications
- License
- CC BY 4.0
- Pricing
- free
- Capabilities
- protein-sequence-retrieval, functional-annotation, taxonomy-search
- Integrations
- biopython, alphafold
- Use Cases
- protein-language-model-training, drug-target-identification, functional-genomics
- API Available
- Yes
- Tags
- proteins, biology, sequences, functional-annotation, bioinformatics
- Added
- 2026-03-17
- Completeness
- 100%
Index Score
80.9Adoption
93
Quality
97
Freshness
92
Citations
97
Engagement
0
Put AI to work for your business
Deploy this dataset alongside autonomous AaaS agents that handle tasks end-to-end — no babysitting required.