☆37Mar 22, 2024Updated 2 years ago
Alternatives and similar repositories for InstructDet
Users that are interested in InstructDet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection☆64Jan 6, 2026Updated 5 months ago
- A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring E…☆356Nov 6, 2025Updated 7 months ago
- Open Set Video HOI detection from Action-centric Chain-of-Look Prompting, ICCV2023☆12Oct 3, 2023Updated 2 years ago
- A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating…☆137Mar 20, 2024Updated 2 years ago
- [CVPR 2023] Official implementation of "SAP-DETR: Bridging the Gap between Salient Points and Queries-Based Transformer Detector for Fast…☆30May 28, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official code for "Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models" (TCSVT'2023)☆28Dec 27, 2023Updated 2 years ago
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Jun 17, 2024Updated 2 years ago
- (TIP 2024) Towards Robust Referring Image Segmentation☆39Mar 2, 2024Updated 2 years ago
- (ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation☆37Oct 18, 2023Updated 2 years ago
- An unofficial implementation for paper "DenseCLIP: Extract Free Dense Labels from CLIP"☆24Jan 27, 2022Updated 4 years ago
- OVMR: Open-Vocabulary Recognition with Multi-Modal References (CVPR24)☆36Jun 16, 2025Updated last year
- Code for "CARIS: Context-Aware Referring Image Segmentation" [ACM MM2023]☆30Nov 28, 2024Updated last year
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆36Feb 28, 2026Updated 4 months ago
- [ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of …☆509Aug 9, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- SOIT: Segmenting Objects with Instance-Aware Transformers☆14Jun 6, 2022Updated 4 years ago
- [CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection☆186Oct 25, 2023Updated 2 years ago
- Official repo for our ICML 23 paper: "Multi-Modal Classifiers for Open-Vocabulary Object Detection"☆95Jun 22, 2023Updated 3 years ago
- Open-vocabulary Semantic Segmentation☆185Mar 28, 2023Updated 3 years ago
- LP-OVOD: Open-Vocabulary Object Detection by Linear Probing (WACV 2024)☆30Jul 23, 2024Updated last year
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆48Jul 18, 2024Updated last year
- ☆61May 2, 2025Updated last year
- ☆12May 6, 2022Updated 4 years ago
- Official Repo for PosSAM: Panoptic Open-vocabulary Segment Anything☆71Apr 7, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- (NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection☆123Apr 26, 2024Updated 2 years ago
- Official Pytorch implementation of 'Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning'? (ICLR2024)☆13Mar 8, 2024Updated 2 years ago
- Code release for "Language-conditioned Detection Transformer"☆86Jun 17, 2024Updated 2 years ago
- Referring Image Segmentation Benchmarking with Segment Anything Model (SAM)☆38Apr 7, 2023Updated 3 years ago
- A Holistic Embodied Cognition Benchmark☆19Apr 3, 2025Updated last year
- [ICCV 2023 Workshop] The Official Implementation of The First Prize Solution for RVOS Competition☆14Jan 1, 2024Updated 2 years ago
- Open-vocabulary Semantic Segmentation☆33Feb 16, 2024Updated 2 years ago
- [ICCV 2019] Monocular depth estimation from a single image☆12May 20, 2022Updated 4 years ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICLR 2023] PyTorch implementation of VLDet (https://arxiv.org/abs/2211.14843)☆192Mar 22, 2024Updated 2 years ago
- Official Implementation of ICCV 2023 Paper - SegPrompt: Boosting Open-World Segmentation via Category-level Prompt Learning☆112May 28, 2025Updated last year
- Related papers about Referring Image Segmentation (RIS)☆16Dec 26, 2023Updated 2 years ago
- CVPR 2023 Accepted Paper HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models☆70Mar 14, 2024Updated 2 years ago
- Official repo for "DynaMITe: Dynamic Query Bootstrapping for Multi-object Interactive Segmentation Transformer"☆19Sep 29, 2023Updated 2 years ago
- Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"☆15Jan 25, 2024Updated 2 years ago
- Harvard Fall 2019 Applied Math 207 A Primer and Critique of Prior Networks☆12Dec 22, 2019Updated 6 years ago