iSEE-Laboratory/ReferDINO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/iSEE-Laboratory/ReferDINO)

iSEE-Laboratory / ReferDINO

(ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations

☆142

Alternatives and similar repositories for ReferDINO

Users that are interested in ReferDINO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

iSEE-Laboratory / Long_RVOS
View on GitHub
(CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation
☆37Feb 28, 2026Updated 4 months ago
iSEE-Laboratory / Refer-Agent
View on GitHub
[CVPR 2026] Refer-Agent: A Collaborative Multi-Agent System with Reasoning and Reflection for Referring Video Object Segmentation
☆35Mar 12, 2026Updated 4 months ago
iSEE-Laboratory / Seg-ReSearch
View on GitHub
(ICML 2026) Seg-ReSearch: Segmentation with Interleaved Reasoning and External Search
☆48May 1, 2026Updated 2 months ago
GLUS-video / GLUS
View on GitHub
[CVPR 2025] Official PyTorch Implementation of GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmenta…
☆70Jun 23, 2025Updated last year
iSEE-Laboratory / TypeTele
View on GitHub
[CoRL2025] Official repository of paper "TypeTele: Releasing Dexterity in Teleoperation by Dexterous Manipulation Types".
☆28Dec 3, 2025Updated 7 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
rongfu-dsb / MPG-SAM2
View on GitHub
[ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation
☆23Sep 5, 2025Updated 10 months ago
iSEE-Laboratory / CycleManip
View on GitHub
[CVPR2026] Official repository of paper "CycleManip: Enabling Cyclic Task Manipulation via Effective Historical Perception and Understand…
☆25Feb 21, 2026Updated 5 months ago
SitongGong / Veason-R1
View on GitHub
Official code of Veason-R1
☆15Jul 14, 2026Updated last week
iSEE-Laboratory / Frozen-DETR
View on GitHub
(NeurIPS 2024) Official repository of paper "Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models"
☆34Mar 22, 2025Updated last year
ClaudiaCuttano / SAMWISE
View on GitHub
[CVPR 2025 Highlight] "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"
☆386Sep 25, 2025Updated 10 months ago
HumanMLLM / IRG-MotionLLM
View on GitHub
(ECCV2026) Official repository of paper "IRG-MotionLLM: Interleaving Motion Generation, Assessment and Refinement for Text-to-Motion Gene…
☆30Jul 1, 2026Updated 3 weeks ago
heshuting555 / DsHmp
View on GitHub
[CVPR-2024] Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation
☆83Jul 24, 2024Updated 2 years ago
Tavarich / Awesome-Referring-Video-Object-Segmentation
View on GitHub
A list of referring video object segmentation papers
☆63Jun 28, 2026Updated 3 weeks ago
Visual-AI / Pancap
View on GitHub
[NeurIPS 2025] Panoptic Captioning: An Equivalence Bridge for Image and Text
☆38Jan 31, 2026Updated 5 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
iSEE-Laboratory / DIF-of-Bimanual-Robotic-Manipulation
View on GitHub
(ICCV 2025) Official repository of paper "Rethinking Bimanual Robotic Manipulation: Learning with Decoupled Interaction Framework
☆15Oct 15, 2025Updated 9 months ago
buxiangzhiren / VD-IT
View on GitHub
Code for the paper "Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation", ECCV 2024
☆48Sep 28, 2024Updated last year
iSEE-Laboratory / PanoDecouple
View on GitHub
(CVPR2025 Highlight) Official repository of paper "Panorama Generation From NFoV Image Done Right"
☆19May 29, 2025Updated last year
cilinyan / ReVOS-api
View on GitHub
[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model
☆22Jul 20, 2024Updated 2 years ago
gaomingqi / Awesome-Video-Object-Segmentation
View on GitHub
🔥 Latest advances in Video Object Segmentation (VOS) – papers, datasets, and projects.
☆515Jul 13, 2026Updated last week
Seung-Hun-Lee / CAVIS
View on GitHub
Official code for CAVIS: Context-Aware Video Instance Segmentation
☆116Sep 17, 2025Updated 10 months ago
IDEA-Research / RexSeek
View on GitHub
[ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark
☆184Oct 15, 2025Updated 9 months ago
HumanMLLM / LOVE-R1
View on GitHub
Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"
☆24Nov 1, 2025Updated 8 months ago
ymq2017 / entitysam
View on GitHub
[CVPR'2025] EntitySAM: Segment Everything in Video
☆67Jul 13, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
FudanCVL / AVI-Bench
View on GitHub
[ICML'26] Toward Human-like Audio-Visual Intelligence of Omni-MLLMs
☆16Jun 20, 2026Updated last month
Hectormxy / OP-SAM
View on GitHub
The official implementation of ICCV 25 OP-SAM "One Polyp Identifies All: One-Shot Polyp Segmentation with SAM via Cascaded Priors and Ite…
☆15Jul 9, 2025Updated last year
dvl-tum / DynOMo
View on GitHub
Official code of DynOMo: Online Point Tracking by Dynamic Online Monocular Gaussian Reconstruction (3DV 2025))
☆174Jan 29, 2025Updated last year
iSEE-Laboratory / EgoExo-Fitness
View on GitHub
(ECCV 2024) Official repository of paper "EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding"
☆38Apr 8, 2025Updated last year
MCG-NJU / SAM2-Plus
View on GitHub
SAM 2++: Tracking Anything at Any Granularity
☆70Dec 15, 2025Updated 7 months ago
ByteDance-Seed / DATAMASK
View on GitHub
Joint Selection for Large-Scale Pre-Training Data via Policy Gradient-based Mask Learning
☆21Jan 4, 2026Updated 6 months ago
HYUNJS / DecAF
View on GitHub
[ICLR 2026] Official implementation of "Decomposed Attention Fusion in MLLMs for Training-Free Video Reasoning Segmentation"
☆36Jan 26, 2026Updated 6 months ago
Run542968 / Awesome-3D-Human-Motion-Generation
View on GitHub
☆25Jul 24, 2024Updated 2 years ago
kumuji / Sa2VA-i
View on GitHub
Sa2VA-i is an improved version of the popular Sa2VA model
☆17Nov 25, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cvlab-kaist / InterRVOS
View on GitHub
Official implementation of "InterRVOS: Interaction-aware Referring Video Object Segmentation".
☆32May 1, 2026Updated 2 months ago
nikosips / UDON
View on GitHub
☆11Nov 18, 2024Updated last year
RAIVNLab / VideoNet
View on GitHub
CVPR '26 Highlight
☆24May 6, 2026Updated 2 months ago
sosppxo / MDIN
View on GitHub
[MM2024 Oral] 3D-GRES: Generalized 3D Referring Expression Segmentation
☆43Dec 15, 2024Updated last year
bytepioneerX / s3mot
View on GitHub
☆33Mar 10, 2026Updated 4 months ago
FudanCVL / SAM-MT
View on GitHub
[ECCV 2026] Real-Time Interactive Multi-Target Video Segmentation
☆54Jul 10, 2026Updated 2 weeks ago
jbistanbul / universalvtg
View on GitHub
Official Code for the paper "UniversalVTG: A Univeral and Lightweight Foundation Model for Video Temporal Grounding"
☆15Apr 15, 2026Updated 3 months ago