motern88 / Det-SAM2Links
Det-Model offer bbox as conditional prompt in SAM2 video predictor Pipeline
☆53Updated 10 months ago
Alternatives and similar repositories for Det-SAM2
Users that are interested in Det-SAM2 are comparing it to the libraries listed below
Sorting:
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2) in real-time.☆54Updated 11 months ago
- Grounded Tracking for Streaming Videos☆123Updated last year
- [AAAI 2026] Code for "SAM2MOT: A Novel Paradigm of Multi-Object Tracking by Segmentation".☆149Updated last week
- [CVPR 2025] "A Distractor-Aware Memory for Visual Object Tracking with SAM2"☆440Updated last month
- Run Segment Anything Model 2 on a live video stream☆540Updated 5 months ago
- Muggled SAM: Segmentation without the magic☆171Updated 2 weeks ago
- [ICLR 2025 oral] RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything☆266Updated 7 months ago
- This repo aims to include materials (papers, codes, slides) about SAM2 (segment anything in images and videos). We are continuously impro…☆124Updated last month
- The Missing Point in Vision Transformers for Universal Image Segmentation☆55Updated 2 weeks ago
- [IROS 2025] NIDS-Net: A unified framework for novel instance detection and segmentation☆70Updated 6 months ago
- yolov8 model with SAM meta☆142Updated 2 years ago
- Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video” (ECCV 2024)☆531Updated last year
- X-SAM: From Segment Anything to Any Segmentation (AAAI2026)☆324Updated last week
- YOLO-World + EfficientViT SAM☆106Updated last year
- ☆38Updated 3 months ago
- ☆130Updated last year
- Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"☆489Updated 8 months ago
- AutoTrackAnything is a universal, flexible and interactive tool for insane automatic object tracking over thousands of frames. It is deve…☆89Updated last year
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆172Updated last month
- Pytorch Implementation of "SMITE: Segment Me In TimE" (ICLR 2025)☆212Updated 2 weeks ago
- 🏄 [ICLR 2025] OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer☆80Updated 3 months ago
- A Graph-Based Approach for Category-Agnostic Pose Estimation [ECCV 2024]☆380Updated last year
- [CVPR 2025 Highlight] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"☆347Updated 2 months ago
- DVIS: Decoupled Video Instance Segmentation Framework☆155Updated last year
- SSA + FastSAM/Semantic Fast Segment Anything , or Fast Semantic Segment Anything☆113Updated 5 months ago
- Official code for NetTrack [CVPR 2024]☆109Updated last year
- Includes the code for training and testing the CountGD model from the paper CountGD: Multi-Modal Open-World Counting.☆287Updated 5 months ago
- OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion☆379Updated 8 months ago
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆59Updated 9 months ago
- [CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).☆490Updated last month