(ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations
☆136Nov 14, 2025Updated 5 months ago
Alternatives and similar repositories for ReferDINO
Users that are interested in ReferDINO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆33Feb 28, 2026Updated last month
- Code for the paper "Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation", ECCV 2024☆47Sep 28, 2024Updated last year
- (NeurIPS 2024) Official repository of paper "Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models"☆35Mar 22, 2025Updated last year
- [CVPR 2025] Official PyTorch Implementation of GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmenta…☆69Jun 23, 2025Updated 9 months ago
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆21Jul 20, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [AAAI 2026] Segment Anything Across Shots: A Method and Benchmark☆30Nov 16, 2025Updated 5 months ago
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆180Oct 15, 2025Updated 6 months ago
- [ICCV 2025] MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation☆22Sep 5, 2025Updated 7 months ago
- Official code of DynOMo: Online Point Tracking by Dynamic Online Monocular Gaussian Reconstruction (3DV 2025))☆173Jan 29, 2025Updated last year
- (CVPR 2026) Official repository of paper "WeDetect: Fast Open-Vocabulary Object Detection as Retrieval"☆168Feb 21, 2026Updated last month
- [CVPR 2025 Highlight] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"☆373Sep 25, 2025Updated 6 months ago
- Video Reasoning Segmentation☆27Nov 29, 2024Updated last year
- 复旦研究生入学教育测试☆24Aug 28, 2025Updated 7 months ago
- ☆38Sep 29, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping☆43Nov 21, 2025Updated 4 months ago
- Large-Vocabulary Video Instance Segmentation dataset☆97Jul 5, 2024Updated last year
- (ECCV 2024) Official repository of paper "EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding"☆32Apr 8, 2025Updated last year
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Jun 17, 2024Updated last year
- [ICCVW 2025] Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation☆81Oct 22, 2025Updated 5 months ago
- (ECCV 2024) Official repository of paper "Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection"☆21Mar 26, 2025Updated last year
- 🔥 Latest advances in Video Object Segmentation (VOS) – papers, datasets, and projects.☆485Updated this week
- [ICCV 2025] MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network☆31Dec 16, 2025Updated 4 months ago
- ☆26Oct 15, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Pruned CoTracker architecture for tracking the myocardium in 2D echo images.☆19May 6, 2025Updated 11 months ago
- [NeurIPS 2025] PANDA: Towards Generalist Video Anomaly Detection via Agentic AI Engineer☆30Oct 2, 2025Updated 6 months ago
- [NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion☆102Oct 29, 2025Updated 5 months ago
- [CVPR 2025] Official implementation of the paper "SimMotionEdit: Text-Based Human Motion Editing with Motion Similarity Prediction"☆46Dec 11, 2025Updated 4 months ago
- (ICCV2025) Official repository of paper "ViSpeak: Visual Instruction Feedback in Streaming Videos"☆47Jul 1, 2025Updated 9 months ago
- [ICCV 2025] MOVE: Motion-Guided Few-Shot Video Object Segmentation☆88Sep 8, 2025Updated 7 months ago
- ☆44Feb 5, 2025Updated last year
- A list of video object segmentation (VOS) papers☆308Oct 22, 2025Updated 5 months ago
- ☆137Jul 4, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- High Quality Video Reasoning Segmentation☆149Nov 24, 2025Updated 4 months ago
- A list of video inpainting (VI) papers☆28Dec 14, 2024Updated last year
- ☆37Sep 25, 2025Updated 6 months ago
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆209Aug 5, 2024Updated last year
- Official Implementation of VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Jo…☆23Jun 27, 2025Updated 9 months ago
- Multimodal Referring Segmentation☆233Jan 22, 2026Updated 2 months ago
- ☆46Apr 26, 2024Updated last year