IDEA-Research/Grounded-Segment-Anything

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/IDEA-Research/Grounded-Segment-Anything)

IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

☆17,666

Alternatives and similar repositories for Grounded-Segment-Anything

Users that are interested in Grounded-Segment-Anything are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

IDEA-Research / GroundingDINO
View on GitHub
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
☆10,394Aug 12, 2024Updated last year
facebookresearch / segment-anything
View on GitHub
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…
☆54,550Sep 18, 2024Updated last year
UX-Decoder / Segment-Everything-Everywhere-All-At-Once
View on GitHub
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
☆4,794Aug 19, 2024Updated last year
UX-Decoder / Semantic-SAM
View on GitHub
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
☆2,848Jul 10, 2025Updated last year
facebookresearch / dinov2
View on GitHub
PyTorch code and models for the DINOv2 self-supervised learning method.
☆13,109Jun 3, 2026Updated last month
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
CASIA-LMC-Lab / FastSAM
View on GitHub
Fast Segment Anything
☆8,372Jul 30, 2024Updated last year
xinyu1205 / recognize-anything
View on GitHub
Open-source and strong foundation image recognition models.
☆3,688Feb 18, 2025Updated last year
SysCV / sam-hq
View on GitHub
Segment Anything in High Quality [NeurIPS 2023]
☆4,243Sep 12, 2025Updated 10 months ago
IDEA-Research / Grounded-SAM-2
View on GitHub
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
☆3,638Nov 11, 2025Updated 8 months ago
fudan-zvg / Semantic-Segment-Anything
View on GitHub
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
☆2,303Jun 7, 2023Updated 3 years ago
facebookresearch / sam2
View on GitHub
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…
☆19,533May 30, 2026Updated last month
haotian-liu / LLaVA
View on GitHub
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
☆24,923Aug 12, 2024Updated last year
salesforce / LAVIS
View on GitHub
LAVIS - A One-stop Library for Language-Vision Intelligence
☆11,250Jun 2, 2026Updated last month
ChaoningZhang / MobileSAM
View on GitHub
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
☆5,814May 5, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
geekyutao / Inpaint-Anything
View on GitHub
Inpaint anything using Segment Anything and inpainting models.
☆7,655Feb 29, 2024Updated 2 years ago
lllyasviel / ControlNet
View on GitHub
Let us control diffusion models!
☆34,000Feb 25, 2024Updated 2 years ago
gaomingqi / Track-Anything
View on GitHub
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI…
☆6,978Dec 13, 2025Updated 7 months ago
mlfoundations / open_clip
View on GitHub
An open source implementation of CLIP.
☆13,986Updated this week
baaivision / Painter
View on GitHub
Painter & SegGPT Series: Vision Foundation Models from BAAI
☆2,593Dec 6, 2024Updated last year
openai / CLIP
View on GitHub
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
☆33,994Mar 25, 2026Updated 3 months ago
sail-sg / EditAnything
View on GitHub
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
☆3,426Feb 23, 2025Updated last year
huggingface / diffusers
View on GitHub
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
☆34,064Updated this week
microsoft / GLIP
View on GitHub
Grounded Language-Image Pre-training
☆2,604Jan 24, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
LiheYoung / Depth-Anything
View on GitHub
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
☆8,148Jul 17, 2024Updated 2 years ago
Vision-CAIR / MiniGPT-4
View on GitHub
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
☆25,662Sep 2, 2024Updated last year
VainF / Awesome-Anything
View on GitHub
General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX
☆1,850Nov 15, 2023Updated 2 years ago
huggingface / pytorch-image-models
View on GitHub
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…
☆36,986Updated this week
AILab-CVC / YOLO-World
View on GitHub
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
☆6,466Feb 26, 2025Updated last year
facebookresearch / ImageBind
View on GitHub
ImageBind One Embedding Space to Bind Them All
☆9,056Nov 21, 2025Updated 7 months ago
open-mmlab / mmdetection
View on GitHub
OpenMMLab Detection Toolbox and Benchmark
☆32,813Aug 21, 2024Updated last year
z-x-yang / Segment-and-Track-Anything
View on GitHub
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary alg…
☆3,132Jul 3, 2026Updated 2 weeks ago
luca-medeiros / lang-segment-anything
View on GitHub
SAM with text prompt
☆2,592Aug 28, 2025Updated 10 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
yformer / EfficientSAM
View on GitHub
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
☆2,485Dec 24, 2024Updated last year
facebookresearch / detectron2
View on GitHub
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
☆34,599Jun 7, 2026Updated last month
ZrrSkywalker / Personalize-SAM
View on GitHub
Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds
☆1,665Jul 22, 2024Updated last year
openai / consistency_models
View on GitHub
Official repo for consistency models.
☆6,491Mar 22, 2024Updated 2 years ago
microsoft / X-Decoder
View on GitHub
[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language
☆1,346Oct 5, 2023Updated 2 years ago
BradyFU / Awesome-Multimodal-Large-Language-Models
View on GitHub
Latest Advances on Multimodal Large Language Models
☆17,947Jul 2, 2026Updated 2 weeks ago
microsoft / unilm
View on GitHub
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
☆22,159Jan 23, 2026Updated 5 months ago