facebookresearch/sam2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/sam2)

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

☆19,630

Alternatives and similar repositories for sam2

Users that are interested in sam2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / segment-anything
View on GitHub
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…
☆54,625Sep 18, 2024Updated last year
IDEA-Research / Grounded-SAM-2
View on GitHub
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
☆3,667Nov 11, 2025Updated 8 months ago
facebookresearch / sam3
View on GitHub
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading t…
☆11,145Updated this week
facebookresearch / dinov3
View on GitHub
Reference PyTorch implementation and models for DINOv3
☆11,063Jul 15, 2026Updated 2 weeks ago
facebookresearch / dinov2
View on GitHub
PyTorch code and models for the DINOv2 self-supervised learning method.
☆13,176Jun 3, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
IDEA-Research / Grounded-Segment-Anything
View on GitHub
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and …
☆17,687Sep 5, 2024Updated last year
IDEA-Research / GroundingDINO
View on GitHub
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
☆10,459Aug 12, 2024Updated last year
DepthAnything / Depth-Anything-V2
View on GitHub
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
☆8,563Mar 24, 2026Updated 4 months ago
openai / CLIP
View on GitHub
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
☆34,100Mar 25, 2026Updated 4 months ago
QwenLM / Qwen3-VL
View on GitHub
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
☆19,694Jan 30, 2026Updated 6 months ago
facebookresearch / vggt
View on GitHub
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
☆14,053May 19, 2026Updated 2 months ago
LiheYoung / Depth-Anything
View on GitHub
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
☆8,170Jul 17, 2024Updated 2 years ago
haotian-liu / LLaVA
View on GitHub
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
☆24,956Aug 12, 2024Updated last year
UX-Decoder / Semantic-SAM
View on GitHub
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
☆2,854Jul 10, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yangchris11 / samurai
View on GitHub
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
☆7,104Mar 18, 2025Updated last year
CASIA-LMC-Lab / FastSAM
View on GitHub
Fast Segment Anything
☆8,383Jul 30, 2024Updated 2 years ago
AILab-CVC / YOLO-World
View on GitHub
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
☆6,486Feb 26, 2025Updated last year
graphdeco-inria / gaussian-splatting
View on GitHub
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
☆22,829Oct 17, 2025Updated 9 months ago
facebookresearch / co-tracker
View on GitHub
CoTracker is a model for tracking any point (pixel) on a video.
☆5,045Mar 3, 2026Updated 4 months ago
black-forest-labs / flux
View on GitHub
Official inference repo for FLUX.1 models
☆25,832Jul 31, 2025Updated last year
SysCV / sam-hq
View on GitHub
Segment Anything in High Quality [NeurIPS 2023]
☆4,248Sep 12, 2025Updated 10 months ago
huggingface / diffusers
View on GitHub
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
☆34,173Updated this week
UX-Decoder / Segment-Everything-Everywhere-All-At-Once
View on GitHub
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
☆4,794Aug 19, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ByteDance-Seed / Depth-Anything-3
View on GitHub
Depth Anything 3
☆6,008Updated this week
ChaoningZhang / MobileSAM
View on GitHub
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
☆5,831May 5, 2026Updated 2 months ago
mlfoundations / open_clip
View on GitHub
An open source implementation of CLIP.
☆14,034Jul 17, 2026Updated last week
facebookresearch / sam-3d-objects
View on GitHub
SAM 3D Objects
☆7,199Jun 2, 2026Updated last month
facebookresearch / sapiens
View on GitHub
High-resolution models for human tasks.
☆5,411May 26, 2026Updated 2 months ago
facebookresearch / perception_models
View on GitHub
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
☆2,330Apr 13, 2026Updated 3 months ago
yformer / EfficientSAM
View on GitHub
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
☆2,487Dec 24, 2024Updated last year
naver / dust3r
View on GitHub
DUSt3R: Geometric 3D Vision Made Easy
☆7,267Sep 24, 2025Updated 10 months ago
lllyasviel / ControlNet
View on GitHub
Let us control diffusion models!
☆34,026Feb 25, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
NVIDIA / cosmos
View on GitHub
NVIDIA Cosmos is an open platform of world models, datasets, and tools that enables developers to build Physical AI for robots, autonomou…
☆11,309Updated this week
facebookresearch / detectron2
View on GitHub
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
☆34,638Updated this week
ultralytics / ultralytics
View on GitHub
Ultralytics YOLO26, YOLO11, YOLOv8 — object detection, instance segmentation, semantic segmentation, image classification, pose estimatio…
☆60,054Updated this week
OpenGVLab / InternVL
View on GitHub
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
☆10,111Sep 22, 2025Updated 10 months ago
Genesis-Embodied-AI / genesis-world
View on GitHub
Simulation platform for general-purpose robotics & embodied AI learning.
☆29,667Updated this week
facebookresearch / DiT
View on GitHub
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
☆8,693May 31, 2024Updated 2 years ago
LLaVA-VL / LLaVA-NeXT
View on GitHub
☆4,713Jun 15, 2026Updated last month