PaperComputer Visionv1.0

DINOv2: Learning Robust Visual Features without Supervision

by Meta AI · open-source · Last verified 2026-03-17

Presented DINOv2, a self-supervised vision foundation model trained on a curated dataset of 142 million images using a combination of self-distillation and contrastive objectives. DINOv2 features serve as universal visual representations, excelling on depth estimation, segmentation, and classification without fine-tuning.

https://arxiv.org/abs/2304.07193 ↗

C+

C+—Average

Adoption: AQuality: A+Freshness: ACitations: FEngagement: F

Specifications

License: Apache 2.0
Pricing: open-source
Capabilities: feature-extraction, image-classification, depth-estimation, segmentation
Integrations: huggingface
Use Cases: universal-visual-features, transfer-learning, downstream-vision-tasks
API Available: No
Tags: dinov2, self-supervised, vision-transformer, feature-extraction, pretraining
Added: 2026-03-17
Completeness: 100%

Index Score

Adoption

Quality

Freshness

Citations

Engagement

Need this tool deployed for your team?

Get a Custom Setup

Explore the full AI ecosystem on Agents as a Service