xiaomoguhz / OV-DQUOLinks
[AAAI2025] Code Release of OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision
☆25Updated 7 months ago
Alternatives and similar repositories for OV-DQUO
Users that are interested in OV-DQUO are comparing it to the libraries listed below
Sorting:
- NTIRE 2025 Challenge on 1-st Cross-Domain Few-Shot Object Detection @ CVPR 2025☆45Updated 2 months ago
- ☆77Updated last year
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆51Updated 3 months ago
- [WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"☆79Updated 4 months ago
- ☆20Updated 11 months ago
- ☆27Updated last year
- [ECCV 2024] Official implementation of "LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction"☆80Updated 3 months ago
- This repo is the official pytorch implementation of the paper: CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-V…☆33Updated 7 months ago
- [NeurIPS 2024 Spotlight ⭐️] Parameter-Inverted Image Pyramid Networks (PIIP)☆92Updated 2 months ago
- Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation☆51Updated 11 months ago
- [ECCV2024] Official implementation of Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes☆85Updated 2 months ago
- [CVPR2024] Official Pytorch Implementation of SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation.☆176Updated last year
- Awesome video instance segmentation papers☆43Updated last week
- ☆22Updated last month
- Official Code for 'Referring Camouflaged Object Detection (指向性伪装物体检测) ' (TPAMI 2025)☆102Updated 6 months ago
- (ECCV 2024) VCP-CLIP: A visual context prompting model for zero-shot anomaly segmentation☆88Updated last month
- ☆12Updated last year
- CVPR2024☆85Updated 4 months ago
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆82Updated last month
- ☆29Updated 5 months ago
- [ECCV' 24] CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection☆26Updated 9 months ago
- This repository contains the implementation for the paper "Revisiting Few Shot Object Detection with Vision-Language Models"☆74Updated last month
- [ECCV2024] The Official Implementation for ''AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly Detection''☆235Updated last week
- ☆58Updated 11 months ago
- ☆25Updated last year
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆190Updated last year
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆195Updated 11 months ago
- 【CVPR2024】Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-Identification☆107Updated 8 months ago
- [NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.☆21Updated 4 months ago
- Official implementation of the paper 'GlocalCLIP: Object-agnostic Global-Local Prompt Learning for Zero-shot Anomaly Detection'☆25Updated 4 months ago