prannaykaul/mm-ovod

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/prannaykaul/mm-ovod)

prannaykaul / mm-ovod

Official repo for our ICML 23 paper: "Multi-Modal Classifiers for Open-Vocabulary Object Detection"

☆95

Alternatives and similar repositories for mm-ovod

Users that are interested in mm-ovod are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CVMI-Lab / CoDet
View on GitHub
(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
☆123Apr 26, 2024Updated 2 years ago
bytedance / OmniScient-Model
View on GitHub
This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model
☆102Jul 15, 2024Updated 2 years ago
LutingWang / OADP
View on GitHub
Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection
☆64Jan 6, 2026Updated 6 months ago
wusize / ovdet
View on GitHub
[CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection
☆187Oct 25, 2023Updated 2 years ago
clin1223 / VLDet
View on GitHub
[ICLR 2023] PyTorch implementation of VLDet （https://arxiv.org/abs/2211.14843）
☆191Mar 22, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Charles-Xie / awesome-described-object-detection
View on GitHub
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring E…
☆358Nov 6, 2025Updated 8 months ago
tgxs002 / CORA
View on GitHub
A DETR-style framework for open-vocabulary detection (OVD). CVPR 2023
☆202Apr 16, 2023Updated 3 years ago
yuhangzang / OV-DETR
View on GitHub
[Under preparation] Code repo for "Open-Vocabulary DETR with Conditional Matching" (ECCV 2022)
☆240Aug 3, 2022Updated 3 years ago
ZrrSkywalker / CaFo
View on GitHub
[CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners
☆45Jun 14, 2023Updated 3 years ago
nukezil / IOMatch
View on GitHub
[ICCV 2023 Oral] IOMatch: Simplifying Open-Set Semi-Supervised Learning with Joint Inliers and Outliers Utilization
☆57Jan 28, 2024Updated 2 years ago
RAIVNLab / MIMIC
View on GitHub
MIMIC: Masked Image Modeling with Image Correspondences
☆16Jun 14, 2024Updated 2 years ago
lorebianchi98 / FG-OVD
View on GitHub
[CVPR 2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detec…
☆68Apr 4, 2025Updated last year
IDEA-Research / OpenSeeD
View on GitHub
[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"
☆763Jan 22, 2024Updated 2 years ago
janghyuncho / DECOLA
View on GitHub
Code release for "Language-conditioned Detection Transformer"
☆86Jun 17, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Surrey-UP-Lab / RegionSpot
View on GitHub
Recognize Any Regions
☆123Dec 18, 2024Updated last year
SooLab / REP-ERU
View on GitHub
[ECCV2022] A PyTorch implementation of the paper "Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embo…
☆13Mar 20, 2023Updated 3 years ago
witnessai / Awesome-Open-Vocabulary-Object-Detection
View on GitHub
A curated list of papers, datasets and resources pertaining to open vocabulary object detection.
☆422May 13, 2025Updated last year
xuanlinli17 / large_vlm_distillation_ood
View on GitHub
Distilling Large Vision-Language Model with Out-of-Distribution Generalizability (ICCV 2023)
☆61Apr 8, 2024Updated 2 years ago
sail-sg / ScaleLong
View on GitHub
The official repository of paper "ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection" (N…
☆50Oct 23, 2023Updated 2 years ago
jyFengGoGo / InstructDet
View on GitHub
☆37Mar 22, 2024Updated 2 years ago
Dawn-LX / OpenVoc-VidVRD
View on GitHub
Official code for the ICLR2023 paper Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection
☆43Jun 4, 2024Updated 2 years ago
Zehong-Ma / OVMR
View on GitHub
OVMR: Open-Vocabulary Recognition with Multi-Modal References (CVPR24)
☆36Jun 16, 2025Updated last year
MendelXu / SAN
View on GitHub
Open-vocabulary Semantic Segmentation
☆384Oct 16, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
orrzohar / PROB
View on GitHub
[CVPR 2023] Official Pytorch code for PROB: Probabilistic Objectness for Open World Object Detection
☆151Oct 29, 2024Updated last year
amazon-science / prompt-pretraining
View on GitHub
Official implementation for the paper "Prompt Pre-Training with Over Twenty-Thousand Classes for Open-Vocabulary Visual Recognition"
☆259May 3, 2024Updated 2 years ago
wusize / CLIPSelf
View on GitHub
[ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction
☆207Feb 5, 2024Updated 2 years ago
OpenGVLab / all-seeing
View on GitHub
[ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of …
☆507Aug 9, 2024Updated last year
epic-kitchens / epic-kitchens-100-object-masks
View on GitHub
Support library for the MaskRCNN masks extracted on EPIC-KITCHENS-100
☆14Dec 1, 2020Updated 5 years ago
dyabel / detpro
View on GitHub
☆188Nov 7, 2022Updated 3 years ago
xiaofeng94 / SAS-Det
View on GitHub
Taming Self-Training for Open-Vocabulary Object Detection, CVPR 2024
☆22Dec 30, 2023Updated 2 years ago
facebookresearch / Detic
View on GitHub
Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".
☆2,008Mar 21, 2024Updated 2 years ago
frh23333 / mepu-owod
View on GitHub
Code Implementation of "Unsupervised Recognition of Unknown Objects for Open-World Object Detection"
☆34Oct 13, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
haochenheheda / LVVIS
View on GitHub
Large-Vocabulary Video Instance Segmentation dataset
☆100Jul 5, 2024Updated 2 years ago
ekazakos / grove
View on GitHub
Code implementation for the paper "Large-scale Pre-training for Grounded Video Caption Generation" (ICCV 2025)
☆31Jan 18, 2026Updated 6 months ago
CityU-AIM-Group / SOMA
View on GitHub
[ICCV' 23 Oral] Novel Scenes & Classes: Towards Adaptive Open-set Object Detection
☆49May 23, 2025Updated last year
fcjian / PromptDet
View on GitHub
PromptDet: Towards Open-vocabulary Detection using Uncurated Images, ECCV2022
☆173Sep 18, 2022Updated 3 years ago
HunterJ-Lin / WSOVOD
View on GitHub
Code release for "Weakly Supervised Open-Vocabulary Object Detection", AAAI2024
☆36Sep 9, 2024Updated last year
ArrowLuo / SegCLIP
View on GitHub
PyTorch implementation of ICML 2023 paper "SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation"
☆98Jun 28, 2023Updated 3 years ago
shikras / d-cube
View on GitHub
A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating…
☆138Mar 20, 2024Updated 2 years ago