VoxCeleb2
by Oxford Visual Geometry Group (VGG) · free · Last verified 2026-03-17
VoxCeleb2 is a large-scale speaker recognition dataset containing over 1 million utterances from 6,112 celebrities extracted from YouTube videos in challenging real-world conditions. It is the standard benchmark for speaker verification and diarization research, providing naturalistic conversational speech at scale.
https://www.robots.ox.ac.uk/~vgg/data/voxceleb/vox2.html ↗B+
B+—Good
Adoption: AQuality: AFreshness: BCitations: AEngagement: F
Specifications
- License
- CC-BY-4.0
- Pricing
- free
- Capabilities
- speaker-verification, speaker-diarization, speaker-embedding
- Integrations
- SpeechBrain, pyannote.audio, ESPnet
- Use Cases
- speaker-recognition, model-training, benchmark
- API Available
- No
- Tags
- speaker-verification, speaker-recognition, in-the-wild, celebrity-speech
- Added
- 2026-03-17
- Completeness
- 100%
Index Score
73Adoption
84
Quality
88
Freshness
62
Citations
87
Engagement
0
Put AI to work for your business
Deploy this dataset alongside autonomous AaaS agents that handle tasks end-to-end — no babysitting required.