motern88 / Det-SAM2Links
Det-Model offer bbox as conditional prompt in SAM2 video predictor Pipeline
☆53Updated 11 months ago
Alternatives and similar repositories for Det-SAM2
Users that are interested in Det-SAM2 are comparing it to the libraries listed below
Sorting:
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2) in real-time.☆55Updated last year
- Grounded Tracking for Streaming Videos☆123Updated last year
- Muggled SAM: Segmentation without the magic☆175Updated this week
- [AAAI 2026] Code for "SAM2MOT: A Novel Paradigm of Multi-Object Tracking by Segmentation".☆153Updated 3 weeks ago
- [CVPR 2025] "A Distractor-Aware Memory for Visual Object Tracking with SAM2"☆446Updated last month
- [ICLR 2025 oral] RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything☆265Updated 8 months ago
- Run Segment Anything Model 2 on a live video stream☆548Updated 6 months ago
- This repo aims to include materials (papers, codes, slides) about SAM2 (segment anything in images and videos). We are continuously impro…☆131Updated 2 months ago
- ☆39Updated 3 months ago
- [IROS 2025] NIDS-Net: A unified framework for novel instance detection and segmentation☆71Updated 7 months ago
- Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video” (ECCV 2024)☆533Updated last year
- yolov8 model with SAM meta☆142Updated 2 years ago
- SSA + FastSAM/Semantic Fast Segment Anything , or Fast Semantic Segment Anything☆114Updated last week
- X-SAM: From Segment Anything to Any Segmentation (AAAI2026)☆330Updated 2 weeks ago
- Pytorch Implementation of "SMITE: Segment Me In TimE" (ICLR 2025)☆212Updated last month
- Includes the code for training and testing the CountGD model from the paper CountGD: Multi-Modal Open-World Counting.☆287Updated 5 months ago
- YOLO-World + EfficientViT SAM☆106Updated last year
- using clip and sam to segment any instance you specify with text prompt of any instance names☆182Updated 2 years ago
- The Missing Point in Vision Transformers for Universal Image Segmentation☆55Updated last month
- Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"☆488Updated 8 months ago
- ☆130Updated last year
- ☆76Updated 8 months ago
- [ICLR'24 & IJCV‘25] Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching☆538Updated last week
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆172Updated 2 months ago
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆134Updated last year
- Downstream-Dino-V2: A GitHub repository featuring an easy-to-use implementation of the DINOv2 model by Facebook for downstream tasks such…☆265Updated 2 years ago
- ☆76Updated 9 months ago
- ☆87Updated 11 months ago
- [ECCV 2024 & NeurIPS 2024] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3☆268Updated last year
- Official PyTorch implementation of Self-Supervised Any-Point Tracking by Contrastive Random Walks, ECCV 2024.☆54Updated last year