iSEE-Laboratory / ReferDINOLinks
The official implementation of the paper "ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations".
☆40Updated 5 months ago
Alternatives and similar repositories for ReferDINO
Users that are interested in ReferDINO are comparing it to the libraries listed below
Sorting:
- Official implementation of "Seurat: From Moving Points to Depth", CVPR 2025 Highlight☆56Updated 2 months ago
- Implementation of Zero-Shot Video Semantic Segmentation [CVPR 2025]☆49Updated 3 months ago
- Official code for "JAFAR: Jack up Any Feature at Any Resolution"☆124Updated last week
- Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation☆67Updated 3 weeks ago
- [CVPR 2025] Official code for Using Diffusion Priors for Video Amodal Segmentation☆77Updated 2 weeks ago
- ☆41Updated 4 months ago
- [ECCV 2024] Decomposition Betters Tracking Everything Everywhere☆113Updated 11 months ago
- Official PyTorch implementation of Self-Supervised Any-Point Tracking by Contrastive Random Walks, ECCV 2024.☆52Updated 7 months ago
- Scene-Centric Unsupervised Panoptic Segmentation (CVPR 2025 Highlight)☆54Updated last week
- ☆47Updated last year
- Official implementation of "Exploring Temporally-Aware Features for Point Tracking" (CVPR 2025)☆87Updated 2 months ago
- Official Code For Track Everything Everywhere Fast and Robustly☆66Updated 3 months ago
- Official implementation of "Local All-Pair Correspondence for Point Tracking" (ECCV 2024)☆180Updated 2 months ago
- ☆79Updated 5 months ago
- open-sourced video dataset with dynamic scenes and camera movements annotation☆61Updated 2 months ago
- CAVIS: Context-Aware Video Instance Segmentation☆86Updated 2 months ago
- [ICLR 2025] Dataset and Code for Paper "Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels"☆40Updated 2 months ago
- Official pytorch implementation of "XHand: Real-time Expressive Hand Avatar"☆78Updated 10 months ago
- [ArXiv 2025] DNF-Avatar: Distilling Neural Fields for Real-time Animatable Avatar Relighting☆27Updated last month
- StableRecon: Making Video to 3D easy☆76Updated 8 months ago
- Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model☆66Updated 5 months ago
- ☆34Updated last year
- PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage☆45Updated 3 weeks ago
- ECCV 2024 STMA & CVPR 2024 1st MOSE & 1st VOT Challenge & 1st LSVOS v6☆10Updated 8 months ago
- Pytorch Implementation of "SMITE: Segment Me In TimE" (ICLR 2025)☆210Updated 2 months ago
- ☆23Updated 2 months ago
- 4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)☆109Updated 3 months ago
- Official repository for the paper "SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot Segmentation."☆85Updated 3 weeks ago
- ☆32Updated last month
- ☆72Updated 2 months ago