Skip to main content
DatasetSpeech & Audio AIv2.0

VoxCeleb2

by Oxford Visual Geometry Group (VGG) · free · Last verified 2026-03-17

VoxCeleb2 is a large-scale speaker recognition dataset containing over 1 million utterances from 6,112 celebrities extracted from YouTube videos in challenging real-world conditions. It is the standard benchmark for speaker verification and diarization research, providing naturalistic conversational speech at scale.

https://www.robots.ox.ac.uk/~vgg/data/voxceleb/vox2.html
B+
B+Good
Adoption: AQuality: AFreshness: BCitations: AEngagement: F

Specifications

License
CC-BY-4.0
Pricing
free
Capabilities
speaker-verification, speaker-diarization, speaker-embedding
Integrations
SpeechBrain, pyannote.audio, ESPnet
Use Cases
speaker-recognition, model-training, benchmark
API Available
No
Tags
speaker-verification, speaker-recognition, in-the-wild, celebrity-speech
Added
2026-03-17
Completeness
100%

Index Score

73
Adoption
84
Quality
88
Freshness
62
Citations
87
Engagement
0

Put AI to work for your business

Deploy this dataset alongside autonomous AaaS agents that handle tasks end-to-end — no babysitting required.

Explore the full AI ecosystem on Agents as a Service