facebookresearch/dinov2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/dinov2)

facebookresearch / dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

☆13,109

Alternatives and similar repositories for dinov2

Users that are interested in dinov2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / dino
View on GitHub
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
☆7,600Jul 3, 2024Updated 2 years ago
facebookresearch / segment-anything
View on GitHub
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…
☆54,550Sep 18, 2024Updated last year
IDEA-Research / Grounded-Segment-Anything
View on GitHub
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and …
☆17,666Sep 5, 2024Updated last year
IDEA-Research / GroundingDINO
View on GitHub
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
☆10,394Aug 12, 2024Updated last year
facebookresearch / sam2
View on GitHub
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…
☆19,533May 30, 2026Updated last month
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
openai / CLIP
View on GitHub
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
☆33,994Mar 25, 2026Updated 3 months ago
mlfoundations / open_clip
View on GitHub
An open source implementation of CLIP.
☆13,986Updated this week
UX-Decoder / Segment-Everything-Everywhere-All-At-Once
View on GitHub
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
☆4,794Aug 19, 2024Updated last year
haotian-liu / LLaVA
View on GitHub
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
☆24,923Aug 12, 2024Updated last year
salesforce / LAVIS
View on GitHub
LAVIS - A One-stop Library for Language-Vision Intelligence
☆11,250Jun 2, 2026Updated last month
LiheYoung / Depth-Anything
View on GitHub
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
☆8,148Jul 17, 2024Updated 2 years ago
huggingface / pytorch-image-models
View on GitHub
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…
☆36,986Updated this week
facebookresearch / dinov3
View on GitHub
Reference PyTorch implementation and models for DINOv3
☆10,941Updated this week
facebookresearch / ImageBind
View on GitHub
ImageBind One Embedding Space to Bind Them All
☆9,056Nov 21, 2025Updated 7 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
facebookresearch / DiT
View on GitHub
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
☆8,674May 31, 2024Updated 2 years ago
facebookresearch / mae
View on GitHub
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
☆8,364Jul 23, 2024Updated last year
CASIA-LMC-Lab / FastSAM
View on GitHub
Fast Segment Anything
☆8,372Jul 30, 2024Updated last year
lllyasviel / ControlNet
View on GitHub
Let us control diffusion models!
☆34,000Feb 25, 2024Updated 2 years ago
huggingface / diffusers
View on GitHub
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
☆34,064Updated this week
baaivision / EVA
View on GitHub
EVA Series: Visual Representation Fantasies from BAAI
☆2,684Aug 1, 2024Updated last year
UX-Decoder / Semantic-SAM
View on GitHub
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
☆2,848Jul 10, 2025Updated last year
baaivision / Painter
View on GitHub
Painter & SegGPT Series: Vision Foundation Models from BAAI
☆2,593Dec 6, 2024Updated last year
SysCV / sam-hq
View on GitHub
Segment Anything in High Quality [NeurIPS 2023]
☆4,243Sep 12, 2025Updated 10 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
openai / consistency_models
View on GitHub
Official repo for consistency models.
☆6,491Mar 22, 2024Updated 2 years ago
lucidrains / vit-pytorch
View on GitHub
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…
☆25,413Jun 22, 2026Updated 3 weeks ago
facebookresearch / Mask2Former
View on GitHub
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
☆3,408Jul 29, 2024Updated last year
facebookresearch / detr
View on GitHub
End-to-End Object Detection with Transformers
☆15,336Mar 12, 2024Updated 2 years ago
facebookresearch / detectron2
View on GitHub
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
☆34,599Jun 7, 2026Updated last month
microsoft / unilm
View on GitHub
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
☆22,159Jan 23, 2026Updated 5 months ago
ChaoningZhang / MobileSAM
View on GitHub
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
☆5,814May 5, 2026Updated 2 months ago
IDEA-Research / DINO
View on GitHub
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
☆2,825Jul 31, 2024Updated last year
microsoft / Swin-Transformer
View on GitHub
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
☆15,996Jul 24, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
naver / dust3r
View on GitHub
DUSt3R: Geometric 3D Vision Made Easy
☆7,242Sep 24, 2025Updated 9 months ago
Vision-CAIR / MiniGPT-4
View on GitHub
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
☆25,662Sep 2, 2024Updated last year
graphdeco-inria / gaussian-splatting
View on GitHub
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
☆22,659Oct 17, 2025Updated 9 months ago
CompVis / latent-diffusion
View on GitHub
High-Resolution Image Synthesis with Latent Diffusion Models
☆14,104Feb 29, 2024Updated 2 years ago
mhamilton723 / FeatUp
View on GitHub
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024
☆1,652Jun 28, 2024Updated 2 years ago
DepthAnything / Depth-Anything-V2
View on GitHub
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
☆8,464Mar 24, 2026Updated 3 months ago
xinyu1205 / recognize-anything
View on GitHub
Open-source and strong foundation image recognition models.
☆3,688Feb 18, 2025Updated last year