LutingWang / OADP
Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection
☆54Updated last week
Related projects: ⓘ
- [CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection☆172Updated 10 months ago
- SeqTR: A Simple yet Universal Network for Visual Grounding☆128Updated 3 months ago
- PyTorch implementation of ICML 2023 paper "SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation"☆78Updated last year
- Exploiting unlabeled data with vision and language models for object detection, ECCV 2022☆86Updated 8 months ago
- Code Implementation of "Unsupervised Recognition of Unknown Objects for Open-World Object Detection"☆25Updated 11 months ago
- ☆59Updated last year
- ☆33Updated this week
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆27Updated last month
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆62Updated 4 months ago
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆56Updated last year
- [ICCV 2023] Code for "Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement"☆135Updated 4 months ago
- CVPR 2023 Accepted Paper HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models☆52Updated 6 months ago
- ☆34Updated 2 years ago
- ☆32Updated last year
- A lightweight codebase for referring expression comprehension and segmentation☆50Updated 2 years ago
- [TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.☆104Updated 2 months ago
- ☆171Updated last year
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆161Updated 7 months ago
- [ICCV 2023] PyTorch implementation of RandBox☆51Updated 10 months ago
- [CVPR 2024] The official implementation for "MS-DETR: Efficient DETR Training with Mixed Supervision"☆77Updated 2 months ago
- ☆85Updated 11 months ago
- OvarNet official implement of the paper "OvarNet: Towards Open-vocabulary Object Attribute Recognition"☆98Updated last year
- ☆32Updated 5 months ago
- Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022☆88Updated last year
- Code for the paper: "SuS-X: Training-Free Name-Only Transfer of Vision-Language Models" [ICCV'23]☆91Updated last year
- Code release for "Weakly Supervised Open-Vocabulary Object Detection", AAAI2024☆27Updated last week
- [AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation☆25Updated 7 months ago
- Referring Video Object Segmentation / Multi-Object Tracking Repo☆84Updated last year
- Official repo for our ICML 23 paper: "Multi-Modal Classifiers for Open-Vocabulary Object Detection"☆76Updated last year
- [ICCV' 23 ORAL] Novel Scenes & Classes: Towards Adaptive Open-set Object Detection☆34Updated 6 months ago