heshuting555 / D2ZeroLinks
[CVPR-2023] Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation
☆18Updated 2 years ago
Alternatives and similar repositories for D2Zero
Users that are interested in D2Zero are comparing it to the libraries listed below
Sorting:
- [CVPR-2024] Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation☆85Updated last year
- [ACM MM-2024] RefMask3D: Language-Guided Transformer for 3D Referring Segmentation☆65Updated last year
- Multimodal Referring Segmentation☆181Updated 2 months ago
- A benchmark dataset for GRES and GREC [CVPR2023 Highlight]☆238Updated last week
- [CVPR-2023] Primitive Generation and Semantic-related Alignment for Universal Zero-Shot Segmentation☆190Updated 2 years ago
- [ICCV2021 & TPAMI2023] Vision-Language Transformer and Query Generation for Referring Segmentation☆359Updated 3 years ago
- [ICCV 2023] MOSE: A New Dataset for Video Object Segmentation in Complex Scenes☆361Updated 2 months ago
- [ICCV 2025] MOVE: Motion-Guided Few-Shot Video Object Segmentation☆83Updated 2 months ago
- [ICCV 2023] PyTorch implementation of RandBox☆56Updated 2 years ago
- ☆37Updated last year
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆30Updated last year
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆48Updated last year
- [ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation☆48Updated last year
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Updated 2 years ago
- [ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation☆55Updated 2 years ago
- Code for the VOST dataset☆26Updated 2 years ago
- OVAD: Open-vocabulary Attribute Detection code☆31Updated 2 years ago
- [IJCV 2025] VLPrompt-PSG: Vision-Language Prompting for Panoptic Scene Graph Generation☆27Updated last year
- Large-Vocabulary Video Instance Segmentation dataset☆95Updated last year
- [TCSVT 2024] Temporally Consistent Referring Video Object Segmentation with Hybrid Memory☆18Updated 7 months ago
- Code for the paper "Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation", ECCV 2024☆45Updated last year
- [AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation☆29Updated last year
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆67Updated last year
- [ECCV 2024] ControlCap: Controllable Region-level Captioning☆79Updated last year
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆95Updated 7 months ago
- [ICCV2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆41Updated last year
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆19Updated last year
- 【CVPR'24】OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition☆38Updated last year
- ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation (CVPR'25)☆18Updated 7 months ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆45Updated last year