patrick-tssn/Streaming-Grounded-SAM-2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/patrick-tssn/Streaming-Grounded-SAM-2)

patrick-tssn / Streaming-Grounded-SAM-2

Grounded Tracking for Streaming Videos

☆127

Alternatives and similar repositories for Streaming-Grounded-SAM-2

Users that are interested in Streaming-Grounded-SAM-2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Gy920 / segment-anything-2-real-time
View on GitHub
Run Segment Anything Model 2 on a live video stream
☆593Jun 3, 2025Updated last year
ShuoShenDe / Grounded-Sam2-Tracking
View on GitHub
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and …
☆13Aug 29, 2024Updated last year
IDEA-Research / Grounded-SAM-2
View on GitHub
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
☆3,655Nov 11, 2025Updated 8 months ago
heyoeyo / muggled_sam
View on GitHub
Muggled SAM: Segmentation without the magic
☆261Jun 26, 2026Updated 3 weeks ago
khw11044 / SAM2_streaming
View on GitHub
☆24Jun 19, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
robrosinc / REALTIME_SAM2
View on GitHub
☆41Feb 26, 2026Updated 5 months ago
urbste / nanosam2
View on GitHub
Tools to distill the Hiera transformer backbone to CNNs that are easier to deploy on the edge.
☆16Dec 4, 2024Updated last year
zch0414 / p2sam
View on GitHub
Part-aware Prompted Segment Anything Model for Adaptive Segmentation [TMLR 2025]
☆11Feb 19, 2026Updated 5 months ago
ZhangDailing8 / CPDTrack
View on GitHub
☆18Feb 8, 2026Updated 5 months ago
JiaqiLi404 / Super_resolution_DINO
View on GitHub
The application of large pre-trained vision model DINOv2 from MetaAI for feature points matching, and a ViT decoder used for Auto Encoder
☆18Apr 27, 2023Updated 3 years ago
ailia-ai / segment-anything-2
View on GitHub
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…
☆73Mar 30, 2026Updated 3 months ago
MyrnaCCS / contrastive-gaussian-clustering
View on GitHub
Code release for Contrastive Gaussian Clustering (CGC), a method for zero-shot 3D scene segmentation.
☆16Aug 8, 2024Updated last year
nuomizai / T2VLM
View on GitHub
[ICCV'25] T2 -VLM: Training-Free Generation of Temporally Consistent Rewards from VLMs
☆16Jul 8, 2025Updated last year
franciszchen / SCA-Net
View on GitHub
☆10Oct 7, 2023Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
BCV-Uniandes / TAPIR
View on GitHub
☆38Apr 5, 2025Updated last year
zhifanzhu / getagrip
View on GitHub
☆34Dec 4, 2025Updated 7 months ago
lalithjets / Global-reasoned-multi-task-model
View on GitHub
Globally reasoned multi-task model for surgical scene understanding. A multi-task model for segmentation and scene graph. Offical Impleme…
☆15May 5, 2022Updated 4 years ago
OrigamiSL / OTETrack
View on GitHub
Source code of the paper: Overlapped Trajectory-Enhanced Visual Tracking
☆11Sep 3, 2024Updated last year
eshoyuan / TrackGPT
View on GitHub
TrackGPT: Track What You Need in Videos via Text Prompts
☆25May 16, 2023Updated 3 years ago
NVlabs / sds-complete
View on GitHub
☆80Sep 8, 2024Updated last year
Runsong123 / PCF-Lift
View on GitHub
Code Release for ECCV 2024, "PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion"
☆21Mar 23, 2025Updated last year
TRI-ML / OctMAE
View on GitHub
Zero-Shot Multi-Object Shape Completion (ECCV 2024)
☆31Apr 1, 2025Updated last year
XiaokunFeng / CTVLT
View on GitHub
[ICASSP'25] Enhancing Vision-Language Tracking by Effectively Converting Textual Cues into Visual Cues
☆19Dec 31, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
qizekun / SoFar
View on GitHub
[NeurIPS 2025 Spotlight] SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
☆245Jun 30, 2025Updated last year
XuMengyaAmy / CIDACaptioning
View on GitHub
☆17Jul 5, 2021Updated 5 years ago
LanternW / Minecraft-GridMap-Gene
View on GitHub
a project of FastLab
☆12Oct 30, 2023Updated 2 years ago
YvesXu / panoptic-segmentation-paper-list
View on GitHub
A paper list of panoptic segmentation using deep learning
☆12Sep 5, 2021Updated 4 years ago
tomasalex / aerosol
View on GitHub
Aerosol Optical Depth Statistical Analysis
☆11Jun 1, 2016Updated 10 years ago
shanglidan / under_water
View on GitHub
和鲸社区Kesci 水下目标检测算法赛（声学图像赛项）a榜 top3 b榜 top9
☆14Jun 10, 2020Updated 6 years ago
983632847 / SAM-for-Videos
View on GitHub
This repository is for the first survey on SAM & SAM2 for Videos.
☆53Apr 29, 2025Updated last year
caopulan / Mask2Former-LT
View on GitHub
[TMM2024] Official code of "Frequency-based Matcher for Long-tailed Semantic Segmentation".
☆13Jun 3, 2024Updated 2 years ago
Krying / PD_SSL_ZOO
View on GitHub
☆18Jun 3, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
nirlipo / ltl2pddl
View on GitHub
LTL2PDDL tool
☆13Jul 7, 2017Updated 9 years ago
cai4cai / cholec_instance_seg
View on GitHub
CholecInstanceSeg: A Tool Instance Segmentation Dataset for Laparoscopic Surgery
☆16Dec 18, 2025Updated 7 months ago
z-x-yang / Segment-and-Track-Anything
View on GitHub
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary alg…
☆3,134Jul 3, 2026Updated 3 weeks ago
pablovela5620 / sam2-depthanything
View on GitHub
☆80Apr 14, 2025Updated last year
MM-FIRE / FIRE
View on GitHub
☆13Nov 5, 2024Updated last year
xuxw98 / ESAM
View on GitHub
[ICLR 2025, Oral] EmbodiedSAM: Online Segment Any 3D Thing in Real Time
☆634May 7, 2025Updated last year
nuomizai / HIL-RL
View on GitHub
The official repository for the paper "Real-world Reinforcement Learning from Suboptimal Interventions”.
☆58Jul 3, 2026Updated 3 weeks ago