iSEE-Laboratory / ReferDINO
The official implementation of the paper "ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations".
☆38Updated 3 months ago
Alternatives and similar repositories for ReferDINO:
Users that are interested in ReferDINO are comparing it to the libraries listed below
- Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model☆57Updated 3 months ago
- Implementation of Zero-Shot Video Semantic Segmentation [CVPR 2025]☆46Updated last month
- Official implementation of "Seurat: From Moving Points to Depth", CVPR 2025 Highlight☆22Updated 2 weeks ago
- [ECCV 2024] Decomposition Betters Tracking Everything Everywhere☆113Updated 9 months ago
- ☆47Updated 10 months ago
- Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation☆62Updated last month
- ☆56Updated 3 weeks ago
- Official Code For Track Everything Everywhere Fast and Robustly☆60Updated last month
- Official implementation of "Exploring Temporally-Aware Features for Point Tracking" (CVPR 2025)☆73Updated 3 weeks ago
- The official implementation of "CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities". (arXiv 2501.08983)☆89Updated 3 months ago
- [CVPR25] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"☆164Updated 2 weeks ago
- Official PyTorch implementation of Self-Supervised Any-Point Tracking by Contrastive Random Walks, ECCV 2024.☆50Updated 5 months ago
- ☆23Updated last month
- Official implementation of "URECA : Unique Region Caption Anything"☆43Updated 2 weeks ago
- [CVPR 2025] Official code for Using Diffusion Priors for Video Amodal Segmentation☆71Updated last week
- ☆29Updated last year
- Official pytorch implementation of "XHand: Real-time Expressive Hand Avatar"☆77Updated 8 months ago
- [ArXiv 2025] DNF-Avatar: Distilling Neural Fields for Real-time Animatable Avatar Relighting☆23Updated last week
- Official Code for "MITracker: Multi-View Integration for Visual Object Tracking"☆60Updated last month
- CAVIS: Context-Aware Video Instance Segmentation☆86Updated last week
- Official implementation of "Local All-Pair Correspondence for Point Tracking" (ECCV 2024)☆166Updated last week
- ☆78Updated 3 months ago
- Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment☆50Updated 3 months ago
- [3DV 2025] Learning Naturally Aggregated Appearance for Efficient 3D Editing☆34Updated 2 months ago
- official repo of paper for "CamI2V: Camera-Controlled Image-to-Video Diffusion Model"☆125Updated last month
- ☆34Updated last year
- An unofficial implementation of DreamScene360.☆80Updated 10 months ago
- Scaling Properties of Diffusion Models For Perceptual Tasks☆38Updated 5 months ago
- Official repository for "Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation" (ICLR2025)☆69Updated 2 weeks ago
- ☆22Updated 3 weeks ago