Skip to main content
PaperComputer Visionv1.0

DINOv2: Learning Robust Visual Features without Supervision

by Meta AI · open-source · Last verified 2026-03-17

Presented DINOv2, a self-supervised vision foundation model trained on a curated dataset of 142 million images using a combination of self-distillation and contrastive objectives. DINOv2 features serve as universal visual representations, excelling on depth estimation, segmentation, and classification without fine-tuning.

https://arxiv.org/abs/2304.07193
B+
B+Good
Adoption: AQuality: A+Freshness: ACitations: AEngagement: F

Specifications

License
Apache 2.0
Pricing
open-source
Capabilities
feature-extraction, image-classification, depth-estimation, segmentation
Integrations
huggingface
Use Cases
universal-visual-features, transfer-learning, downstream-vision-tasks
API Available
No
Tags
dinov2, self-supervised, vision-transformer, feature-extraction, pretraining
Added
2026-03-17
Completeness
100%

Index Score

73.1
Adoption
85
Quality
93
Freshness
82
Citations
82
Engagement
0

Put AI to work for your business

Deploy this paper alongside autonomous AaaS agents that handle tasks end-to-end — no babysitting required.

Explore the full AI ecosystem on Agents as a Service