UVA-Computer-Vision-Lab/ovmono3d

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/UVA-Computer-Vision-Lab/ovmono3d)

UVA-Computer-Vision-Lab / ovmono3d

[3DV 2026] Open Vocabulary Monocular 3D Object Detection

☆98

Alternatives and similar repositories for ovmono3d

Users that are interested in ovmono3d are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Lizhuoling / UniMODE
View on GitHub
☆52May 6, 2025Updated last year
OpenDriveLab / DetAny3D
View on GitHub
[ICCV 2025] Detect Anything 3D in the Wild
☆284Dec 14, 2025Updated 7 months ago
PuFanqi23 / MonoDGP
View on GitHub
[CVPR 2025] The offical implementation of 'MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors'
☆97Aug 13, 2025Updated 11 months ago
UVA-Computer-Vision-Lab / LabelAny3D
View on GitHub
[NeurIPS 2025] LabelAny3D: Label Any Object 3D in the Wild
☆130Jan 6, 2026Updated 6 months ago
NickHezhuolin / OS-Det3D
View on GitHub
☆17Jun 29, 2026Updated 3 weeks ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
yangtiming / ImOV3D
View on GitHub
ImOV3D: Learning Open Vocabulary Point Clouds 3D Object Detection from Only 2D Images (NeurIPS2024)
☆94Feb 20, 2026Updated 5 months ago
sanmin0312 / LabelDistill
View on GitHub
[ECCV 2024] LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection
☆44Nov 1, 2024Updated last year
LeapLabTHU / OVM3D-Det
View on GitHub
☆55Jan 2, 2025Updated last year
google-deepmind / omninocs
View on GitHub
A large-scale NOCS dataset.
☆102Jul 12, 2024Updated 2 years ago
JihyeokKim / MonoDINO-DETR
View on GitHub
MonoDINO-DETR: Depth-Enhanced Monocular 3D Object Detection Using a Vision Foundation Model
☆46May 27, 2025Updated last year
cvg / 3D-MOOD
View on GitHub
[ICCV'25] 3D-MOOD: Lifting 2D to 3D for Monocular Open-Set Object Detection
☆123Oct 14, 2025Updated 9 months ago
robot-learning-freiburg / dualviewdistill
View on GitHub
Bridging Perspectives: Foundation Model Guided BEV Maps for 3D Object Detection and Tracking
☆20Oct 13, 2025Updated 9 months ago
W-Ted / N3D-VLM
View on GitHub
Official code for paper: N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models
☆116Jan 14, 2026Updated 6 months ago
SungHunYang / MonoCLUE
View on GitHub
[ AAAI 2026 ] The official implementation of 'MonoCLUE: Object-Aware Clustering Enhances Monocular 3D Object Detection'
☆21Mar 23, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
UVA-Computer-Vision-Lab / 3d_annotator
View on GitHub
3D BBox refinement interface used in LabelAny3D (NeurIPS 2025)
☆22Jan 6, 2026Updated 6 months ago
zhenyuw16 / Uni3DETR
View on GitHub
Code release for our NeurIPS 2023 paper "Uni3DETR: Unified 3D Detection Transformer", our ECCV 2024 paper "OV-Uni3DETR: Towards Unified O…
☆119Jul 29, 2024Updated last year
lyhdet / OV-3DET
View on GitHub
☆99Mar 25, 2024Updated 2 years ago
chreisinger / ViLGOD
View on GitHub
Vision-Language Guidance for LiDAR-based Unsupervised 3D Object Detection
☆27Nov 21, 2024Updated last year
565353780 / auto-scan2cad
View on GitHub
☆14Oct 6, 2024Updated last year
alanzhangcs / MonoCoP
View on GitHub
[CVPR 2026 Highlight] MonoCoP: Unleashing the Power of Chain-of-Prediction for Monocular 3D Object Detection
☆19Mar 31, 2026Updated 3 months ago
aminebdj / OpenYOLO3D
View on GitHub
[ICLR 2025 (Oral 📢) ] Our OpenYOLO3D model achieves state-of-the-art performance in Open Vocabulary 3D Instance Segmentation on ScanNet2…
☆259Mar 17, 2025Updated last year
facebookresearch / omni3d
View on GitHub
Code release for "Omni3D A Large Benchmark and Model for 3D Object Detection in the Wild"
☆856Apr 7, 2024Updated 2 years ago
KuanchihHuang / VG-W3D
View on GitHub
[ECCV2024] Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance
☆23Jul 14, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yangcaoai / CoDA_NeurIPS2023
View on GitHub
Official code for NeurIPS2023 paper CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detec…
☆222May 28, 2026Updated last month
lucifer443 / RecurrentBEV
View on GitHub
[ECCV 2024] RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection
☆33Sep 28, 2024Updated last year
RM-Zhang / SGCDet
View on GitHub
[ICCV 2025] Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction
☆28Oct 1, 2025Updated 9 months ago
VDIGPKU / OpenAD
View on GitHub
[NeurIPS 2025] OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection
☆70Nov 28, 2024Updated last year
arijitray1993 / SAT
View on GitHub
Spatial Aptitude Training for Multimodal Langauge Models
☆33Feb 8, 2026Updated 5 months ago
Ruiyang-061X / LiSe
View on GitHub
[ECCV'24] Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene.
☆40Sep 3, 2024Updated last year
Hongbin98 / MonoTTA
View on GitHub
Code release for the ECCV 2024 paper 'Fully Test-Time Adaptation for Monocular 3D Object Detection'
☆58Dec 10, 2024Updated last year
Sense-X / GeoMIM
View on GitHub
[ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding
☆53Aug 28, 2023Updated 2 years ago
TedLentsch / UNION
View on GitHub
Unsupervised 3D Object Detection [NeurIPS 2024]
☆45Feb 12, 2026Updated 5 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
7zk1014 / PanoEnv
View on GitHub
☆15Jun 21, 2026Updated last month
iris0329 / SeeGround
View on GitHub
[CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding
☆222Apr 21, 2025Updated last year
facebookresearch / boxer
View on GitHub
Code for the Boxer research paper
☆598Jul 1, 2026Updated 2 weeks ago
AadSah / kyvo
View on GitHub
This repository contains the code for the paper - "Aligning Text, Images, and 3D Structure Token-by-Token" (CVPR 2026)
☆49Jun 11, 2025Updated last year
wufeim / imagenet3d
View on GitHub
ImageNet3D: Towards General-Purpose Object-Level 3D Understanding
☆21Dec 6, 2024Updated last year
gyhandy / 3D-Copy-Paste
View on GitHub
[NeurIPS 2023] 3D Copy-Paste: Physically Plausible Object Insertion for Monocular 3D Detection
☆58Mar 27, 2024Updated 2 years ago
ZhanYang-nwpu / Mono3DVG
View on GitHub
[AAAI 2024] Mono3DVG: 3D Visual Grounding in Monocular Images, AAAI, 2024
☆72Apr 9, 2024Updated 2 years ago