heshuting555 / D2ZeroLinks
[CVPR-2023] Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation
☆18Updated 2 years ago
Alternatives and similar repositories for D2Zero
Users that are interested in D2Zero are comparing it to the libraries listed below
Sorting:
- [CVPR-2024] Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation☆86Updated last year
- [ACM MM-2024] RefMask3D: Language-Guided Transformer for 3D Referring Segmentation☆66Updated last year
- Multimodal Referring Segmentation☆201Updated last month
- A benchmark dataset for GREx: GRES, GREC, and GREG [CVPR 2023 & IJCV 2026]☆239Updated 2 months ago
- [ICCV 2025] MOVE: Motion-Guided Few-Shot Video Object Segmentation☆87Updated 4 months ago
- [CVPR-2023] Primitive Generation and Semantic-related Alignment for Universal Zero-Shot Segmentation☆189Updated 2 years ago
- [ICCV2021 & TPAMI2023] Vision-Language Transformer and Query Generation for Referring Segmentation☆358Updated 4 years ago
- [ICCV 2023] MOSE: A New Dataset for Video Object Segmentation in Complex Scenes☆361Updated 4 months ago
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆48Updated last year
- OVAD: Open-vocabulary Attribute Detection code☆31Updated 2 years ago
- [ICCV 2023] PyTorch implementation of RandBox☆56Updated 2 years ago
- ☆32Updated last year
- Large-Vocabulary Video Instance Segmentation dataset☆96Updated last year
- [ICCV 2025] AnyI2V: Animating Any Conditional Image with Motion Control Generation☆119Updated 5 months ago
- [IJCV 2025] VLPrompt-PSG: Vision-Language Prompting for Panoptic Scene Graph Generation☆28Updated last year
- ☆13Updated last year
- [NeurIPS 2025] Composed Person Retrieval (CPR) is a new cross-modal retrieval task that aims to identify individuals in large-scale perso…☆72Updated 3 months ago
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆30Updated last year
- Official repo of "Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens"☆269Updated 2 weeks ago
- [TIP-2023] Prototype Adaption and Projection for Few- and Zero-shot 3D Point Cloud Semantic Segmentation☆82Updated 2 years ago
- [ICCV 2023] CTVIS: Consistent Training for Online Video Instance Segmentation☆80Updated 2 years ago
- Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation (NeurIPS 23)☆12Updated 8 months ago
- ☆37Updated last year
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆52Updated 3 months ago
- [ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation☆49Updated last year
- [TCSVT 2024] Temporally Consistent Referring Video Object Segmentation with Hybrid Memory☆18Updated 9 months ago
- [AAAI 2026] Segment Anything Across Shots: A Method and Benchmark☆26Updated 2 months ago
- Disentangled Pre-training for Human-Object Interaction Detection☆27Updated 4 months ago
- [ECCV 2024] ControlCap: Controllable Region-level Captioning☆80Updated last year
- [NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation☆92Updated last year