heshuting555 / D2Zero
[CVPR-2023] Semantic-Promoted Debiasing and Background Disambiguation for Zero-Shot Instance Segmentation
☆17Updated last year
Related projects ⓘ
Alternatives and complementary repositories for D2Zero
- [CVPR-2024] Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation☆76Updated 3 months ago
- [TIP-2023] Prototype Adaption and Projection for Few- and Zero-shot 3D Point Cloud Semantic Segmentation☆57Updated last year
- [AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation☆24Updated 9 months ago
- [CVPR-2023] Primitive Generation and Semantic-related Alignment for Universal Zero-Shot Segmentation☆182Updated last year
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆26Updated 4 months ago
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Updated last year
- [ACM MM-2024] RefMask3D: Language-Guided Transformer for 3D Referring Segmentation☆46Updated 3 months ago
- A benchmark dataset for GRES and GREC [CVPR2023 Highlight]☆176Updated last year
- [ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation☆52Updated last year
- [ICML2024]The official implementation of SemiRES in PyTorch.☆19Updated 4 months ago
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated 11 months ago
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆45Updated 3 months ago
- [NeurIPS 2023] Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation☆20Updated 10 months ago
- The offical implemention of JM3D.☆27Updated last year
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)☆32Updated last year
- (ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation☆33Updated last year
- Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-Labeling @ CVPR22☆42Updated 2 years ago
- VLPrompt: Vision-Language Prompting for Panoptic Scene Graph Generation☆18Updated last month
- [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"☆27Updated 6 months ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆62Updated 2 months ago
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆35Updated last year
- ☆29Updated 7 months ago
- Robust Referring Video Object Segmentation with Cyclic Structural Consistency [ICCV 2023]☆25Updated 7 months ago
- [ICCV 23] A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detection☆10Updated 6 months ago
- Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆40Updated 3 months ago
- OVAD: Open-vocabulary Attribute Detection code☆28Updated last year
- (TIP 2024) Towards Robust Referring Image Segmentation☆22Updated 8 months ago
- Official repository of paper: "FeatAug-DETR: Enriching One-to-Many Matching for DETRs with Feature Augmentation"☆24Updated last year
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆29Updated 5 months ago
- [TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation☆48Updated 9 months ago