heshuting555 / D2ZeroLinks
[CVPR-2023] Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation
☆18Updated 2 years ago
Alternatives and similar repositories for D2Zero
Users that are interested in D2Zero are comparing it to the libraries listed below
Sorting:
- [CVPR-2024] Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation☆85Updated last year
- Multimodal Referring Segmentation☆118Updated last week
- [ACM MM-2024] RefMask3D: Language-Guided Transformer for 3D Referring Segmentation☆65Updated last year
- A benchmark dataset for GRES and GREC [CVPR2023 Highlight]☆238Updated last year
- [ICCV 2023] MOSE: A New Dataset for Video Object Segmentation in Complex Scenes☆356Updated last year
- [ICCV2021 & TPAMI2023] Vision-Language Transformer and Query Generation for Referring Segmentation☆359Updated 3 years ago
- [CVPR-2023] Primitive Generation and Semantic-related Alignment for Universal Zero-Shot Segmentation☆190Updated 2 years ago
- ☆32Updated last year
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆47Updated last year
- OVAD: Open-vocabulary Attribute Detection code☆31Updated 2 years ago
- [TIP-2023] Prototype Adaption and Projection for Few- and Zero-shot 3D Point Cloud Semantic Segmentation☆80Updated 2 years ago
- [ICCV 2025] AnyI2V: Animating Any Conditional Image with Motion Control Generation☆109Updated last week
- [ICCV 2023] MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions☆521Updated 3 weeks ago
- [ICCV 2023] PyTorch implementation of RandBox☆56Updated last year
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆42Updated last year
- (ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation☆38Updated last year
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆67Updated 10 months ago
- [CVPR2023 Highlight] GRES: Generalized Referring Expression Segmentation☆691Updated last year
- [TCSVT] state-of-the-art open vocabulary detector on COCO/LVIS/V3Det☆32Updated 3 months ago
- 【CVPRW'23】First Place Solution to the CVPR'2023 AQTC Challenge☆15Updated 2 years ago
- [ICCV 2025] MOVE: Motion-Guided Few-Shot Video Object Segmentation☆20Updated last month
- [ICCV'2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆40Updated last year
- [AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation☆29Updated last year
- ☆25Updated 2 years ago
- [ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding☆22Updated 6 months ago
- Disentangled Pre-training for Human-Object Interaction Detection☆25Updated 2 months ago
- [IJCV 2025] VLPrompt-PSG: Vision-Language Prompting for Panoptic Scene Graph Generation☆27Updated 11 months ago
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆30Updated last year
- Test-Time Training on Video Streams☆64Updated 2 years ago
- ☆18Updated last year