facebookresearch/sam3

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/sam3)

facebookresearch / sam3

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

☆11,063

Alternatives and similar repositories for sam3

Users that are interested in sam3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / dinov3
View on GitHub
Reference PyTorch implementation and models for DINOv3
☆10,993Jul 15, 2026Updated last week
facebookresearch / sam-3d-objects
View on GitHub
SAM 3D Objects
☆7,161Jun 2, 2026Updated last month
facebookresearch / sam2
View on GitHub
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…
☆19,577May 30, 2026Updated last month
ByteDance-Seed / Depth-Anything-3
View on GitHub
Depth Anything 3
☆5,956Jul 15, 2026Updated last week
facebookresearch / sam-3d-body
View on GitHub
The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints …
☆3,381Feb 19, 2026Updated 5 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
IDEA-Research / Grounded-SAM-2
View on GitHub
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
☆3,652Nov 11, 2025Updated 8 months ago
facebookresearch / vggt
View on GitHub
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
☆13,968May 19, 2026Updated 2 months ago
facebookresearch / segment-anything
View on GitHub
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…
☆54,590Sep 18, 2024Updated last year
roboflow / rf-detr
View on GitHub
RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO, designed for fine-tuning…
☆8,658Updated this week
facebookresearch / dinov2
View on GitHub
PyTorch code and models for the DINOv2 self-supervised learning method.
☆13,150Jun 3, 2026Updated last month
facebookresearch / map-anything
View on GitHub
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
☆3,585Updated this week
QwenLM / Qwen3-VL
View on GitHub
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
☆19,650Jan 30, 2026Updated 5 months ago
facebookresearch / perception_models
View on GitHub
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
☆2,330Apr 13, 2026Updated 3 months ago
DepthAnything / Depth-Anything-V2
View on GitHub
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
☆8,521Mar 24, 2026Updated 3 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
IDEA-Research / Rex-Omni
View on GitHub
[CVPR2026] Detect Anything via Next Point Prediction
☆1,516Feb 22, 2026Updated 5 months ago
yyfz / Pi3
View on GitHub
[ICLR 2026] π^3: Permutation-Equivariant Visual Geometry Learning
☆2,084Jul 3, 2026Updated 2 weeks ago
IDEA-Research / GroundingDINO
View on GitHub
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
☆10,437Aug 12, 2024Updated last year
microsoft / MoGe
View on GitHub
[CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
☆2,672Updated this week
facebookresearch / vjepa2
View on GitHub
PyTorch code and models for VJEPA2 self-supervised learning from video.
☆4,387Mar 23, 2026Updated 4 months ago
NVlabs / FoundationStereo
View on GitHub
[CVPR 2025 Best Paper Nomination] FoundationStereo: Zero-Shot Stereo Matching
☆2,834Dec 19, 2025Updated 7 months ago
nv-tlabs / vipe
View on GitHub
ViPE: Video Pose Engine for Geometric 3D Perception
☆2,049Jun 9, 2026Updated last month
Intellindust-AI-Lab / DEIMv2
View on GitHub
[DEIMv2] Real Time Object Detection Meets DINOv3
☆1,942Mar 24, 2026Updated 4 months ago
NVlabs / RADIO
View on GitHub
Official repository for "AM-RADIO: Reduce All Domains Into One"
☆1,900May 29, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
IDEA-Research / Grounded-Segment-Anything
View on GitHub
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and …
☆17,686Sep 5, 2024Updated last year
DepthAnything / Video-Depth-Anything
View on GitHub
[CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
☆2,005Oct 7, 2025Updated 9 months ago
bytedance / Sa2VA
View on GitHub
Official Repo For Pixel-LLM Codebase: Sa2VA (Arxiv-25), SAMTok (CVPR-26), VRT, SaSaSa2VA (1-st solution for LSVOS)
☆1,650Jun 19, 2026Updated last month
THU-MIG / yoloe
View on GitHub
YOLOE: Real-Time Seeing Anything [ICCV 2025]
☆2,214Jun 26, 2025Updated last year
nerfstudio-project / gsplat
View on GitHub
CUDA accelerated rasterization of gaussian splatting
☆5,438Updated this week
facebookresearch / vggt-omega
View on GitHub
[CVPR 2026 Oral] VGGT Omega
☆3,644Jul 15, 2026Updated last week
NVIDIA / cosmos
View on GitHub
NVIDIA Cosmos is an open platform of world models, datasets, and tools that enables developers to build Physical AI for robots, autonomou…
☆11,212Updated this week
openai / CLIP
View on GitHub
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
☆34,054Mar 25, 2026Updated 3 months ago
CVHub520 / X-AnyLabeling
View on GitHub
Effortless data labeling with AI support from Segment Anything and other awesome models.
☆9,854Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
QwenLM / Qwen-Image
View on GitHub
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
☆8,158Feb 10, 2026Updated 5 months ago
henry123-boy / SpaTrackerV2
View on GitHub
[ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy
☆984Feb 27, 2026Updated 4 months ago
Robbyant / lingbot-vision
View on GitHub
Self-supervised learning for spatial perception
☆850Jul 8, 2026Updated 2 weeks ago
CUT3R / CUT3R
View on GitHub
Official implementation of Continuous 3D Perception Model with Persistent State
☆1,468Aug 27, 2025Updated 10 months ago
graphdeco-inria / gaussian-splatting
View on GitHub
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
☆22,755Oct 17, 2025Updated 9 months ago
Wan-Video / Wan2.2
View on GitHub
Wan: Open and Advanced Large-Scale Video Generative Models
☆16,812Mar 17, 2026Updated 4 months ago
facebookresearch / EUPE
View on GitHub
Efficient Universal Perception Encoder: a single on-device vision encoder with versatile representations that match or exceed specialized…
☆690Apr 14, 2026Updated 3 months ago