rongfu-dsb/MPG-SAM2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rongfu-dsb/MPG-SAM2)

rongfu-dsb / MPG-SAM2

[ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation

☆23

Alternatives and similar repositories for MPG-SAM2

Users that are interested in MPG-SAM2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

GLUS-video / GLUS
View on GitHub
[CVPR 2025] Official PyTorch Implementation of GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmenta…
☆70Jun 23, 2025Updated last year
GeWu-Lab / Ref-AVS
View on GitHub
The official repo for "Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes", ECCV 2024
☆50Oct 12, 2025Updated 9 months ago
GeWu-Lab / Stepping-Stones
View on GitHub
The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024
☆18Oct 11, 2024Updated last year
cilinyan / ReVOS-api
View on GitHub
[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model
☆22Jul 20, 2024Updated 2 years ago
SitongGong / Veason-R1
View on GitHub
Official code of Veason-R1
☆15Jul 14, 2026Updated 2 weeks ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
buxiangzhiren / VD-IT
View on GitHub
Code for the paper "Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation", ECCV 2024
☆48Sep 28, 2024Updated last year
LinfengYuan1997 / LoSh
View on GitHub
[CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation
☆13Jun 17, 2024Updated 2 years ago
cjhing / OS-FPI
View on GitHub
Official repository of OS-FPI
☆17Dec 22, 2024Updated last year
DanielSHKao / CoT-RVS
View on GitHub
[ICLR 2026] Official implementation for CoT-RVS
☆24Mar 17, 2026Updated 4 months ago
ClaudiaCuttano / SAMWISE
View on GitHub
[CVPR 2025 Highlight] "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"
☆386Sep 25, 2025Updated 10 months ago
gaomingqi / Awesome-Video-Object-Segmentation
View on GitHub
🔥 Latest advances in Video Object Segmentation (VOS) – papers, datasets, and projects.
☆515Jul 13, 2026Updated 2 weeks ago
iSEE-Laboratory / ReferDINO
View on GitHub
(ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations
☆142Nov 14, 2025Updated 8 months ago
wysnzzzz / DIT
View on GitHub
☆18Nov 15, 2024Updated last year
Dmmm1997 / PropVG
View on GitHub
[ICCV2025] PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination
☆32Oct 13, 2025Updated 9 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
rongfu-dsb / RS2-SAM2
View on GitHub
[AAAI 2026] RS2-SAM2: Customized SAM2 for Referring Remote Sensing Image Segmentation
☆23Feb 22, 2026Updated 5 months ago
Dmmm1997 / MomentSeg
View on GitHub
[ECCV2026] MomentSeg: Moment-Centric Sampling for Enhanced Video Pixel Understanding
☆24Jun 19, 2026Updated last month
cvlab-kaist / SOLA
View on GitHub
Official implementation of "Referring Video Object Segmentation via Language Aligned Track Selection".
☆41Jun 2, 2025Updated last year
dengandong / GroundMoRe
View on GitHub
☆18May 18, 2026Updated 2 months ago
bo-miao / HTR
View on GitHub
[TCSVT 2024] Temporally Consistent Referring Video Object Segmentation with Hybrid Memory
☆19Apr 9, 2025Updated last year
yxchng / mask-grounding
View on GitHub
[CVPR2024] Mask Grounding for Referring Image Segmentation
☆29Jul 22, 2024Updated 2 years ago
ruohaoguo / avis
View on GitHub
[CVPR 2025] 🔥 Official impl. of "Audio-Visual Instance Segmentation".
☆52Jun 5, 2025Updated last year
linhuixiao / OneRef
View on GitHub
[NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.
☆32Nov 13, 2025Updated 8 months ago
sydai / referring-expression-counting
View on GitHub
☆28Feb 21, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Dmmm1997 / C3VG
View on GitHub
[AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints
☆45Jul 2, 2025Updated last year
SuleBai / SC-CLIP
View on GitHub
[TIP 2025] Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation
☆73Mar 27, 2026Updated 4 months ago
iSEE-Laboratory / Long_RVOS
View on GitHub
(CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation
☆37Feb 28, 2026Updated 5 months ago
yannqi / COMBO-AVS
View on GitHub
[CVPR 2024 Highlight] Official implementation of the paper: Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-…
☆40Apr 20, 2025Updated last year
PJLallen / InstanceSAM2Eval
View on GitHub
Evaluation Study on SAM 2 for Class-agnostic Instance-level Segmentation
☆19Nov 12, 2025Updated 8 months ago
RobertLuo1 / iccv2023_RVOS_Challenge
View on GitHub
[ICCV 2023 Workshop] The Official Implementation of The First Prize Solution for RVOS Competition
☆14Jan 1, 2024Updated 2 years ago
jasongief / TGS-Agent
View on GitHub
[2026 AAAI] Think Before You Segment: An Object-aware Reasoning Agent for Referring Audio-Visual Segmentation
☆20Nov 8, 2025Updated 8 months ago
wudongming97 / OnlineRefer
View on GitHub
[ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation
☆58Oct 7, 2023Updated 2 years ago
YuanJiayuuu / SWA-PF
View on GitHub
☆31Sep 22, 2025Updated 10 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
chenwei746 / EEVG
View on GitHub
☆23Aug 20, 2024Updated last year
Sephirex-X / ADNet
View on GitHub
[ICCV 2023] ADNet: Lane Shape Prediction via Anchor Decomposition
☆37Oct 11, 2023Updated 2 years ago
henghuiding / Awesome-Multimodal-Referring-Segmentation
View on GitHub
[IJCV 2026] Multimodal Referring Segmentation
☆255Jun 30, 2026Updated 3 weeks ago
YichuXu / MambaMoE
View on GitHub
[Information Fusion 2025] MambaMoE: Mixture-of-Spectral-Spatial-Experts State Space Model for Hyperspectral Image Classification
☆50Mar 21, 2026Updated 4 months ago
FudanCVL / OmniAVS
View on GitHub
[ICCV 2025] Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation
☆91Sep 29, 2025Updated 10 months ago
Dmmm1997 / InstanceVG
View on GitHub
[TPAMI2025] Improving Generalized Visual Grounding with Instance-aware Joint Learning
☆33Apr 28, 2026Updated 3 months ago
iSEE-Laboratory / Seg-ReSearch
View on GitHub
(ICML 2026) Seg-ReSearch: Segmentation with Interleaved Reasoning and External Search
☆49May 1, 2026Updated 2 months ago