A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)
☆64Apr 10, 2026Updated this week
Alternatives and similar repositories for OVDEval
Users that are interested in OVDEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [EMNLP-2025 Oral] ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration☆80Nov 20, 2025Updated 4 months ago
- ☆22Jun 30, 2023Updated 2 years ago
- Exploiting unlabeled data with vision and language models for object detection, ECCV 2022☆94Jan 16, 2024Updated 2 years ago
- [CVPR 2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detec…☆67Apr 4, 2025Updated last year
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆38Sep 12, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Papers of "A Survey on Multimodal LLMs from the Perspective of Input-Output Space Extension"☆17Feb 4, 2026Updated 2 months ago
- A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating…☆137Mar 20, 2024Updated 2 years ago
- [Under preparation] Code repo for "Open-Vocabulary DETR with Conditional Matching" (ECCV 2022)☆237Aug 3, 2022Updated 3 years ago
- GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)☆74Updated this week
- Code for paper 'Zero-Shot Scene Graph Generation via Triplet Calibration and Reduction' (TOMM 2023)☆10Sep 6, 2025Updated 7 months ago
- [WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"☆84Jan 2, 2026Updated 3 months ago
- GEOSatDB is a semantic representation of Earth observation satellites and sensors that can be used to easily discover available Earth obs…☆15Aug 6, 2024Updated last year
- OVAD: Open-vocabulary Attribute Detection code☆31Aug 28, 2023Updated 2 years ago
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆202Feb 5, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022☆34Aug 4, 2023Updated 2 years ago
- This repository provides data for the VAW dataset as described in the CVPR 2021 paper titled "Learning to Predict Visual Attributes in th…☆70Jul 22, 2022Updated 3 years ago
- [CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection☆194Mar 29, 2025Updated last year
- [ICCV 2025] Official implementation of the paper: "Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Obj…☆77Jul 29, 2025Updated 8 months ago
- MMPD Dataset from ECCV'2024 "When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset"☆21Jul 15, 2024Updated last year
- [ICLR 2023] PyTorch implementation of VLDet (https://arxiv.org/abs/2211.14843)☆191Mar 22, 2024Updated 2 years ago
- Make Large Multimodal Models excel in object detection, ICCV 2025☆64Aug 1, 2025Updated 8 months ago
- ☆20Jan 7, 2024Updated 2 years ago
- Code Implementation of "Simple Image-level Classification Improves Open-vocabulary Object Detection" (AAAI'24)☆29Jan 12, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A curated list of papers, datasets and resources pertaining to open vocabulary object detection.☆414May 13, 2025Updated 11 months ago
- Emergent Visual Grounding in Large Multimodal Models Without Grounding Supervision☆43Oct 19, 2025Updated 5 months ago
- (NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection☆123Apr 26, 2024Updated last year
- Code for the paper "Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundatio…☆28Nov 8, 2023Updated 2 years ago
- ☆120Jun 11, 2024Updated last year
- ☆32Mar 7, 2022Updated 4 years ago
- Official implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".☆45Sep 12, 2024Updated last year
- [CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection☆185Oct 25, 2023Updated 2 years ago
- [TCSVT] state-of-the-art open vocabulary detector on COCO/LVIS/V3Det☆33Jun 3, 2025Updated 10 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring E…☆348Nov 6, 2025Updated 5 months ago
- Supplementary Material for Non-binary Deep Transfer Learning for Image Classification☆18Jul 22, 2021Updated 4 years ago
- Code use to create COCO Attributes dataset and experiments in the associate ECCV 2016 paper.☆49Dec 26, 2022Updated 3 years ago
- Auto Segmentation label generation with SAM (Segment Anything) + Grounding DINO☆22Feb 11, 2025Updated last year
- Official pyTorch implementation of Transformer-based PAUP model for sequential recommentation, SIGIR 2022☆10Sep 8, 2022Updated 3 years ago
- RO-ViT CVPR 2023 "Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers"☆17Aug 24, 2023Updated 2 years ago
- Localized Vision-Language Matching for Open-vocabulary Object Detection☆22Aug 11, 2022Updated 3 years ago