heshuting555 / D2ZeroLinks
[CVPR-2023] Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation
☆18Updated 2 years ago
Alternatives and similar repositories for D2Zero
Users that are interested in D2Zero are comparing it to the libraries listed below
Sorting:
- [CVPR-2024] Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation☆86Updated last year
- [ACM MM-2024] RefMask3D: Language-Guided Transformer for 3D Referring Segmentation☆66Updated last year
- A benchmark dataset for GREx: GRES, GREC, and GREG [CVPR 2023 & IJCV 2026]☆239Updated 2 months ago
- Multimodal Referring Segmentation☆208Updated 2 weeks ago
- [CVPR-2023] Primitive Generation and Semantic-related Alignment for Universal Zero-Shot Segmentation☆189Updated 2 years ago
- [ICCV 2023] MOSE: A New Dataset for Video Object Segmentation in Complex Scenes☆363Updated 4 months ago
- [ICCV2021 & TPAMI2023] Vision-Language Transformer and Query Generation for Referring Segmentation☆358Updated 4 years ago
- OVAD: Open-vocabulary Attribute Detection code☆31Updated 2 years ago
- [ICCV 2025] MOVE: Motion-Guided Few-Shot Video Object Segmentation☆87Updated 5 months ago
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆48Updated last year
- [ICCV 2023] PyTorch implementation of RandBox☆56Updated 2 years ago
- [NeurIPS 2025] Composed Person Retrieval (CPR) is a new cross-modal retrieval task that aims to identify individuals in large-scale perso…☆72Updated 3 months ago
- [ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation☆49Updated last year
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆30Updated last year
- [TCSVT] state-of-the-art open vocabulary detector on COCO/LVIS/V3Det☆32Updated 8 months ago
- ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation (CVPR'25)☆18Updated 10 months ago
- ☆32Updated last year
- [ICCV 2023] CTVIS: Consistent Training for Online Video Instance Segmentation☆80Updated 2 years ago
- [TIP-2023] Prototype Adaption and Projection for Few- and Zero-shot 3D Point Cloud Semantic Segmentation☆82Updated 2 years ago
- [TCSVT 2024] Temporally Consistent Referring Video Object Segmentation with Hybrid Memory☆19Updated 10 months ago
- [ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation☆57Updated 2 years ago
- Large-Vocabulary Video Instance Segmentation dataset☆96Updated last year
- [IJCV 2025] VLPrompt-PSG: Vision-Language Prompting for Panoptic Scene Graph Generation☆28Updated last year
- [ICCV 2025] AnyI2V: Animating Any Conditional Image with Motion Control Generation☆119Updated 5 months ago
- (ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation☆37Updated 2 years ago
- ☆37Updated last year
- Code for the paper "Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation", ECCV 2024☆47Updated last year
- Segment Anything with Deictic Prompting☆27Updated 8 months ago
- The benchmark for "Video Object Segmentation in Panoptic Wild Scenes".☆12Updated 2 years ago
- [IEEE TCSVT] Official Pytorch Implementation of CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation.☆47Updated last year