shikras/d-cube

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shikras/d-cube)

shikras / d-cube

A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating Object Detection with Flexible Expressions" (NeurIPS 2023).

☆138

Alternatives and similar repositories for d-cube

Users that are interested in d-cube are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Charles-Xie / awesome-described-object-detection
View on GitHub
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring E…
☆358Nov 6, 2025Updated 8 months ago
Charles-Xie / CQL
View on GitHub
Code for our paper "Category Query Learning for Human-Object Interaction Classification" (CVPR2023)
☆37Jul 9, 2023Updated 3 years ago
OpenGVLab / all-seeing
View on GitHub
[ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of …
☆508Aug 9, 2024Updated last year
CVMI-Lab / CoDet
View on GitHub
(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
☆123Apr 26, 2024Updated 2 years ago
V3Det / V3Det
View on GitHub
☆121Jun 11, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jshilong / GPT4RoI
View on GitHub
(ECCVW 2025)GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
☆556Jun 3, 2025Updated last year
jyFengGoGo / InstructDet
View on GitHub
☆37Mar 22, 2024Updated 2 years ago
microsoft / FIBER
View on GitHub
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
☆131Oct 10, 2023Updated 2 years ago
clin1223 / VLDet
View on GitHub
[ICLR 2023] PyTorch implementation of VLDet （https://arxiv.org/abs/2211.14843）
☆191Mar 22, 2024Updated 2 years ago
FoundationVision / GenerateU
View on GitHub
[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection
☆196Mar 29, 2025Updated last year
tsb0601 / MMVP
View on GitHub
☆364Jan 27, 2024Updated 2 years ago
berkeley-hipie / HIPIE
View on GitHub
[NeurIPS2023] Code release for "Hierarchical Open-vocabulary Universal Image Segmentation"
☆294Jun 19, 2025Updated last year
wusize / ovdet
View on GitHub
[CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection
☆187Oct 25, 2023Updated 2 years ago
YiqunChen1999 / RefineBox
View on GitHub
Implementation of Enhancing Your Trained DETRs with Box Refinement
☆60Jul 26, 2023Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
samschulter / omnilabeltools
View on GitHub
A Python toolkit for the OmniLabel benchmark providing code for evaluation and visualization
☆23Feb 1, 2025Updated last year
zhjohnchan / bert-clip-synesthesia
View on GitHub
[Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.
☆14Jun 7, 2023Updated 3 years ago
impiga / Plain-DETR
View on GitHub
[ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design
☆232Nov 14, 2023Updated 2 years ago
jinga-lala / DAMEX
View on GitHub
Code for "DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of mixture-of-datasets", accepted at Neurips 2023 (Main confer…
☆28Mar 29, 2024Updated 2 years ago
jianzongwu / betrayed-by-captions
View on GitHub
(ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation
☆48Jul 18, 2024Updated 2 years ago
JacobYuan7 / RLIPv2
View on GitHub
[ICCV 2023] RLIPv2: Fast Scaling of Relational Language-Image Pre-training
☆136May 28, 2024Updated 2 years ago
IDEA-Research / OpenSeeD
View on GitHub
[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"
☆762Jan 22, 2024Updated 2 years ago
shenyunhang / APE
View on GitHub
[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception
☆609May 8, 2024Updated 2 years ago
CongHan0808 / DeOP
View on GitHub
Open-vocabulary Semantic Segmentation
☆33Feb 16, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
kingthreestones / RefCLIP
View on GitHub
☆39Jun 28, 2023Updated 3 years ago
om-ai-lab / OVDEval
View on GitHub
A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)
☆63Apr 10, 2026Updated 3 months ago
jshilong / DDQ
View on GitHub
(CVPR2023)Dense Distinct Query for End-to-End Object Detection
☆266May 24, 2023Updated 3 years ago
mightyzau / RegionBLIP
View on GitHub
☆59Aug 7, 2023Updated 2 years ago
deepglint / ALIP
View on GitHub
[ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption
☆106Sep 18, 2023Updated 2 years ago
seanzhuh / SeqTR
View on GitHub
SeqTR: A Simple yet Universal Network for Visual Grounding
☆144Oct 30, 2024Updated last year
Show-han / Zeroshot_REC
View on GitHub
Official code for Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions (CVPR 2024)
☆28Jun 21, 2024Updated 2 years ago
MarkMoHR / Awesome-Referring-Image-Segmentation
View on GitHub
A collection of papers about Referring Image Segmentation.
☆826Jan 28, 2026Updated 5 months ago
shuheikurita / RefEgo
View on GitHub
☆13Jul 20, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
FishAndWasabi / Real-LOD
View on GitHub
Offical implementation of "Re-Aligning Language to Visual Objects with an Agentic Workflow"
☆34Apr 20, 2025Updated last year
FelixCaae / AlignDETR
View on GitHub
[BMVC 2024] Official implementation of Align-DETR
☆61Jul 24, 2024Updated last year
facebookresearch / VLPart
View on GitHub
[ICCV2023] VLPart: Going Denser with Open-Vocabulary Part Segmentation
☆395Sep 19, 2023Updated 2 years ago
wusize / CLIPSelf
View on GitHub
[ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction
☆207Feb 5, 2024Updated 2 years ago
cv516Buaa / OV-VG
View on GitHub
☆31Mar 25, 2024Updated 2 years ago
LinfengYuan1997 / LoSh
View on GitHub
[CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation
☆13Jun 17, 2024Updated 2 years ago
TheShadow29 / awesome-grounding
View on GitHub
awesome grounding: A curated list of research papers in visual grounding
☆1,126Sep 21, 2025Updated 10 months ago