A list of referring video object segmentation papers
☆57Jun 6, 2025Updated 9 months ago
Alternatives and similar repositories for Awesome-Referring-Video-Object-Segmentation
Users that are interested in Awesome-Referring-Video-Object-Segmentation are comparing it to the libraries listed below
Sorting:
- Awesome video instance segmentation papers☆51Dec 17, 2025Updated 2 months ago
- Learning Better Video Query with SAM for Video Instance Segmentation (TCSVT 2024)☆26Apr 2, 2024Updated last year
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆19Jul 20, 2024Updated last year
- [CVPR-2024] Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation☆85Jul 24, 2024Updated last year
- Code for the paper "Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation", ECCV 2024☆47Sep 28, 2024Updated last year
- ☆18Jun 6, 2025Updated 9 months ago
- [CVPR 2025] Official PyTorch Implementation of GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmenta…☆66Jun 23, 2025Updated 8 months ago
- [ECCV 2024 Oral] ActionVOS: Actions as Prompts for Video Object Segmentation☆31Dec 4, 2024Updated last year
- 🔥 Latest advances in Video Object Segmentation (VOS) – papers, datasets, and projects.☆468Feb 18, 2026Updated 2 weeks ago
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Jun 17, 2024Updated last year
- [CVPR 2021] FMO Deblurring Benchmark☆13Jan 12, 2022Updated 4 years ago
- Multimodal Referring Segmentation☆219Jan 22, 2026Updated last month
- High Quality Video Reasoning Segmentation☆146Nov 24, 2025Updated 3 months ago
- ☆11Mar 11, 2025Updated 11 months ago
- Video Reasoning Segmentation☆28Nov 29, 2024Updated last year
- Code for "CARIS: Context-Aware Referring Image Segmentation" [ACM MM2023]☆28Nov 28, 2024Updated last year
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆30Mar 13, 2024Updated last year
- [ICCV 2023] Spectrum-guided Multi-granularity Referring Video Object Segmentation.☆111Apr 9, 2025Updated 10 months ago
- All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment☆19Feb 11, 2025Updated last year
- [AAAI 2025] AL-Ref-SAM 2: Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video…☆91Dec 23, 2024Updated last year
- A collection of papers about Referring Image Segmentation.☆809Jan 28, 2026Updated last month
- [ECCV 2024] Unified Embedding Alignment for Open-Vocabulary Video Instance Segmentation☆35Jan 6, 2025Updated last year
- [NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation☆33Mar 16, 2024Updated last year
- [CVPR 2024 Challenge] 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation☆32Oct 18, 2024Updated last year
- A list of video object segmentation (VOS) papers☆307Oct 22, 2025Updated 4 months ago
- ☆135Jul 4, 2024Updated last year
- [CVPR2025] Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆108May 29, 2025Updated 9 months ago
- Large-Vocabulary Video Instance Segmentation dataset☆96Jul 5, 2024Updated last year
- The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024☆18Oct 11, 2024Updated last year
- Evaluation Study on SAM 2 for Class-agnostic Instance-level Segmentation☆18Nov 12, 2025Updated 3 months ago
- Referring Image Segmentation Benchmarking with Segment Anything Model (SAM)☆38Apr 7, 2023Updated 2 years ago
- [ICCVW 2025] Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation☆80Oct 22, 2025Updated 4 months ago
- Tracking with Human-Intent Reasoning☆76Nov 4, 2024Updated last year
- 「AAAI 2024」 Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation☆83Jun 13, 2025Updated 8 months ago
- [ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation☆49Sep 24, 2024Updated last year
- (ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations☆129Nov 14, 2025Updated 3 months ago
- [ICCV 2025] MOVE: Motion-Guided Few-Shot Video Object Segmentation☆87Sep 8, 2025Updated 5 months ago
- [MICCAI 2025] Official code implementation for paper: ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tra…☆37Nov 4, 2025Updated 4 months ago
- This repository is for the first survey on SAM & SAM2 for Videos.☆53Apr 29, 2025Updated 10 months ago