xiaomoguhz / OV-DQUOLinks
[AAAI2025] Code Release of OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision
☆20Updated 5 months ago
Alternatives and similar repositories for OV-DQUO
Users that are interested in OV-DQUO are comparing it to the libraries listed below
Sorting:
- This repo is the official pytorch implementation of the paper: CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-V…☆31Updated 5 months ago
- Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation☆49Updated 9 months ago
- NTIRE 2025 Challenge on 1-st Cross-Domain Few-Shot Object Detection @ CVPR 2025☆41Updated last month
- Towards Training-free Open-world Segmentation via Image Prompt Foundation Models,☆10Updated 6 months ago
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆50Updated last month
- Official implementation of ResCLIP: Residual Attention for Training-free Dense Vision-language Inference☆37Updated 2 months ago
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆73Updated 11 months ago
- [ECCV' 24] CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection☆24Updated 8 months ago
- Awesome video instance segmentation papers☆40Updated 2 weeks ago
- ☆21Updated 9 months ago
- ☆58Updated 9 months ago
- Distribution Prototype Diffusion Learning for Open-set Supervised Anomaly Detection CVPR 2025☆13Updated 3 months ago
- [ECCV 2024] Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation☆31Updated 3 months ago
- The official code for "TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning" | [AAAI2025]☆38Updated 2 months ago
- ☆19Updated 7 months ago
- ☆26Updated last year
- ECCV24 "ReMamber: Referring Image Segmentation with Mamba Twister" official repository.☆37Updated 10 months ago
- cliptrase☆36Updated 9 months ago
- ☆75Updated last year
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆188Updated last year
- [NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.☆20Updated 3 months ago
- [AAAI 2025] Official implementation of the paper "EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation"☆26Updated 5 months ago
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆77Updated 5 months ago
- [AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation☆29Updated last year
- ☆41Updated 3 months ago
- (ECCV 2024) Open-Vocabulary Camouflaged Object Segmentation☆23Updated 7 months ago
- Code Implementation of "Simple Image-level Classification Improves Open-vocabulary Object Detection" (AAAI'24)☆25Updated last year
- Improving Single Domain-Generalized Object Detection: A Focus on Diversification and Alignment [CVPR-2024]☆20Updated 11 months ago
- [CVPR 2025] Hybrid Global-Local Representation with Augmented Spatial Guidance for Zero-Shot Referring Image Segmentation☆14Updated 2 weeks ago
- Python code to implement DeIL, a CLIP based approach for open-world few-shot learning.☆15Updated 7 months ago