iSEE-Laboratory / ReferDINOLinks
The official implementation of the paper "ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations".
☆40Updated 4 months ago
Alternatives and similar repositories for ReferDINO
Users that are interested in ReferDINO are comparing it to the libraries listed below
Sorting:
- Implementation of Zero-Shot Video Semantic Segmentation [CVPR 2025]☆46Updated 3 months ago
- [CVPR25] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"☆186Updated last week
- Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model☆64Updated 4 months ago
- Scene-Centric Unsupervised Panoptic Segmentation (CVPR 2025 Highlight)☆43Updated last month
- Official implementation of "Seurat: From Moving Points to Depth", CVPR 2025 Highlight☆50Updated last month
- Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation☆65Updated 2 months ago
- ☆71Updated last month
- [ECCV 2024] Decomposition Betters Tracking Everything Everywhere☆113Updated 10 months ago
- CAVIS: Context-Aware Video Instance Segmentation☆86Updated last month
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆57Updated 3 months ago
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆38Updated 5 months ago
- Official implementation of "URECA : Unique Region Caption Anything"☆46Updated last month
- ☆21Updated 3 weeks ago
- [CVPR'25] Official implementation of "Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation"☆27Updated last month
- Official PyTorch implementation of Self-Supervised Any-Point Tracking by Contrastive Random Walks, ECCV 2024.☆50Updated 7 months ago
- This repository is for the first survey on SAM & SAM2 for Videos.☆49Updated last month
- [CVPR 2025] Official code for Using Diffusion Priors for Video Amodal Segmentation☆73Updated last month
- ☆79Updated 4 months ago
- [NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of …☆68Updated last year
- 4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)☆109Updated 2 months ago
- Official implementation of "Exploring Temporally-Aware Features for Point Tracking" (CVPR 2025)☆77Updated 2 months ago
- ☆47Updated 11 months ago
- ☆43Updated 8 months ago
- Harnessing CLIP, DINO and SAM for Open Vocabulary Segmentation☆58Updated 3 months ago
- Official Code For Track Everything Everywhere Fast and Robustly☆66Updated 2 months ago
- ECCV 2024 STMA & CVPR 2024 1st MOSE & 1st VOT Challenge & 1st LSVOS v6☆10Updated 7 months ago
- ☆30Updated 2 weeks ago
- Official implementation of "Local All-Pair Correspondence for Point Tracking" (ECCV 2024)☆173Updated last month
- ☆15Updated 3 months ago
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆84Updated last week