Code release for "Language-conditioned Detection Transformer"
☆88Jun 17, 2024Updated last year
Alternatives and similar repositories for DECOLA
Users that are interested in DECOLA are comparing it to the libraries listed below
Sorting:
- (NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection☆123Apr 26, 2024Updated last year
- This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model☆99Jul 15, 2024Updated last year
- [NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation☆33Mar 16, 2024Updated last year
- [CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection☆184Oct 25, 2023Updated 2 years ago
- EfficientSAM + YOLO World base model for use with Autodistill.☆10Feb 21, 2024Updated 2 years ago
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆52Oct 26, 2025Updated 4 months ago
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆201Feb 5, 2024Updated 2 years ago
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆27Oct 13, 2024Updated last year
- Official repo for our ICML 23 paper: "Multi-Modal Classifiers for Open-Vocabulary Object Detection"☆95Jun 22, 2023Updated 2 years ago
- Official implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".☆44Sep 12, 2024Updated last year
- [NLPCC'23] ZeroGen: Zero-shot Multimodal Controllable Text Generation with Multiple Oracles PyTorch Implementation☆14Oct 7, 2023Updated 2 years ago
- [ECCV 2024] ControlCap: Controllable Region-level Captioning☆80Oct 25, 2024Updated last year
- [ECCV 2024] Official implementation of "LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction"☆91Dec 23, 2025Updated 2 months ago
- [ACM Multimedia 2025] This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual…☆82Feb 22, 2025Updated last year
- A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring E…☆343Nov 6, 2025Updated 4 months ago
- An unofficial implementation for paper "DenseCLIP: Extract Free Dense Labels from CLIP"☆23Jan 27, 2022Updated 4 years ago
- [CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners☆44Jun 14, 2023Updated 2 years ago
- RO-ViT CVPR 2023 "Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers"☆17Aug 24, 2023Updated 2 years ago
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆131Aug 21, 2024Updated last year
- [ICCV'23] Cascade-DETR: Delving into High-Quality Universal Object Detection☆99Sep 12, 2023Updated 2 years ago
- [ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking☆29Sep 12, 2024Updated last year
- Official implementation for the paper "Prompt Pre-Training with Over Twenty-Thousand Classes for Open-Vocabulary Visual Recognition"☆259May 3, 2024Updated last year
- Official implementation of "Referring Video Object Segmentation via Language Aligned Track Selection".☆40Jun 2, 2025Updated 9 months ago
- Open-vocabulary Semantic Segmentation☆33Feb 16, 2024Updated 2 years ago
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆19Jul 20, 2024Updated last year
- [CVPR 2024] Official implementation of the paper "Visual In-context Learning"☆529Apr 8, 2024Updated last year
- [ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆37Nov 27, 2024Updated last year
- (TPAMI 2024) A Survey on Open Vocabulary Learning☆994Dec 24, 2025Updated 2 months ago
- Taming Self-Training for Open-Vocabulary Object Detection, CVPR 2024☆21Dec 30, 2023Updated 2 years ago
- ☆27Nov 13, 2024Updated last year
- Code for "Long-tail Detection with Effective Class-Margins." (ECCV 2022 Oral)☆63Sep 2, 2023Updated 2 years ago
- (ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation☆37Oct 18, 2023Updated 2 years ago
- [CVPR'24 Highlight] SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection☆98Jul 24, 2024Updated last year
- ☆37Mar 22, 2024Updated last year
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Jun 17, 2024Updated last year
- ☆10Apr 7, 2025Updated 11 months ago
- [ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of …☆505Aug 9, 2024Updated last year
- [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale☆1,170Oct 21, 2024Updated last year
- ☆120Feb 19, 2024Updated 2 years ago