YifanXu74 / MQ-Det
Official PyTorch implementation of "Multi-modal Queried Object Detection in the Wild" (accepted by NeurIPS 2023)
☆280Updated 10 months ago
Alternatives and similar repositories for MQ-Det:
Users that are interested in MQ-Det are comparing it to the libraries listed below
- [CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection☆152Updated 9 months ago
- A curated list of papers, datasets and resources pertaining to open vocabulary object detection.☆296Updated 6 months ago
- Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion☆283Updated this week
- A DETR-style framework for open-vocabulary detection (OVD). CVPR 2023☆180Updated last year
- [CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception☆502Updated 8 months ago
- Code release for our CVPR 2023 paper "Detecting Everything in the Open World: Towards Universal Object Detection".☆554Updated last year
- A new framework for open-vocabulary object detection, based on maskrcnn-benchmark☆233Updated last year
- ☆175Updated 2 years ago
- A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring E…☆237Updated last month
- PromptDet: Towards Open-vocabulary Detection using Uncurated Images, ECCV2022☆163Updated 2 years ago
- Open-vocabulary Semantic Segmentation☆321Updated 3 months ago
- [CVPR 2024] Official implementation of the paper "Visual In-context Learning"☆427Updated 9 months ago
- [NeurIPS 2022] Official repository of paper titled "Bridging the Gap between Object and Image-level Representations for Open-Vocabulary …☆285Updated 2 years ago
- [CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection☆177Updated last year
- CoRL 2024☆363Updated 2 months ago
- [Under preparation] Code repo for "Open-Vocabulary DETR with Conditional Matching" (ECCV 2022)☆214Updated 2 years ago
- [NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convoluti…☆296Updated 11 months ago
- [ICLR 2023] PyTorch implementation of VLDet (https://arxiv.org/abs/2211.14843)☆184Updated 9 months ago
- [ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"☆679Updated 11 months ago
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆180Updated 11 months ago
- [TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.☆114Updated 6 months ago
- [CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"☆732Updated 9 months ago
- CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet☆212Updated 2 years ago
- Pytorch code for paper From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models☆191Updated last week
- GRiT: A Generative Region-to-text Transformer for Object Understanding (https://arxiv.org/abs/2212.00280)☆310Updated last year
- (NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection☆116Updated 8 months ago
- Recognize Any Regions☆122Updated 3 weeks ago
- [ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of …☆472Updated 5 months ago
- This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detectio…☆502Updated 6 months ago
- [ICCV 2023] Official implementation of the paper "Detection Transformer with Stable Matching"☆220Updated 8 months ago