yangtiming/ImOV3D

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yangtiming/ImOV3D)

yangtiming / ImOV3D

ImOV3D: Learning Open Vocabulary Point Clouds 3D Object Detection from Only 2D Images (NeurIPS2024)

☆94

Alternatives and similar repositories for ImOV3D

Users that are interested in ImOV3D are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lyhdet / OV-3DET
View on GitHub
☆99Mar 25, 2024Updated 2 years ago
yangcaoai / 3DGS-DET
View on GitHub
Official codes for paper: 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for Indoor 3D Object …
☆165Mar 16, 2026Updated 4 months ago
zhenyuw16 / Uni3DETR
View on GitHub
Code release for our NeurIPS 2023 paper "Uni3DETR: Unified 3D Detection Transformer", our ECCV 2024 paper "OV-Uni3DETR: Towards Unified O…
☆119Jul 29, 2024Updated last year
yangcaoai / Awesome-Open-Vocabulary-Perception
View on GitHub
😎 Awesome lists of papers and codes about open-vocabulary perception, including both 3D and 2D
☆64Jul 27, 2025Updated 11 months ago
yangcaoai / CoDA_NeurIPS2023
View on GitHub
Official code for NeurIPS2023 paper CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detec…
☆222May 28, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
mkt1412 / FUNCTO_public
View on GitHub
Code implementation of MimicFunc
☆27Aug 8, 2025Updated 11 months ago
UVA-Computer-Vision-Lab / ovmono3d
View on GitHub
[3DV 2026] Open Vocabulary Monocular 3D Object Detection
☆98Apr 29, 2026Updated 2 months ago
aminebdj / OpenYOLO3D
View on GitHub
[ICLR 2025 (Oral 📢) ] Our OpenYOLO3D model achieves state-of-the-art performance in Open Vocabulary 3D Instance Segmentation on ScanNet2…
☆259Mar 17, 2025Updated last year
Pixie8888 / MVSDet
View on GitHub
Code for NeurIPS 2024 work "MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane Sweeps"
☆17Dec 11, 2024Updated last year
liy1shu / FlowBotHD
View on GitHub
FlowBotHD: History-Aware Diffuser Handling Ambiguities in Articulated Objects Manipulation
☆13Dec 13, 2024Updated last year
GradiusTwinbee / GLIS
View on GitHub
officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"
☆14Jul 4, 2024Updated 2 years ago
cxy1997 / DiffuBox
View on GitHub
DiffuBox: Refining 3D Object Detection with Point Diffusion
☆22Mar 9, 2025Updated last year
gwenzhang / GGA
View on GitHub
[ECCV'24] A novel weakly supervised framework for 3D object detection from 2D bounding boxes. It can easily extend to novel scenarios and…
☆36Jul 26, 2024Updated last year
SerCharles / CN-RMA
View on GitHub
Official implementation of CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoor Object Detection from Multi-view Images
☆21Jun 24, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yjtang249 / OnlineAnySeg
View on GitHub
[CVPR 2025] OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging
☆45Jun 6, 2025Updated last year
LeapLabTHU / OVM3D-Det
View on GitHub
☆55Jan 2, 2025Updated last year
kaustpradalab / LLM-Persona-Steering
View on GitHub
Official code of "Exploring the Personality Traits of LLMs through Latent Features Steering"
☆18Jan 30, 2025Updated last year
facebookresearch / efm3d
View on GitHub
This is the official release for the paper "EFM3D A Benchmark for Measuring Progress Towards 3D Egocentric Foundation Models" (https//arx…
☆186Mar 4, 2026Updated 4 months ago
VinAIResearch / Open3DIS
View on GitHub
Open3DIS: Open-vocabulary 3D Instance Segmentation with 2D Mask Guidance (CVPR 2024)
☆135Nov 12, 2024Updated last year
yd-yin / SAI3D
View on GitHub
[CVPR 2024] SAI3D: Segment Any Instance in 3D Scenes
☆162Mar 29, 2024Updated 2 years ago
mwatkins1970 / SAE_Feature_Interpretability_Tool
View on GitHub
A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…
☆19Oct 4, 2024Updated last year
ZhenyangLiu / ReasonGrounder
View on GitHub
☆15Jul 11, 2025Updated last year
ymingxie / PARQ
View on GitHub
Pixel-Aligned Recurrent Queries for Multi-View 3D Object Detection (ICCV23)
☆45Oct 19, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
GAP-LAB-CUHK-SZ / SAMPro3D
View on GitHub
SAMPro3D: Locating SAM Prompts in 3D for Zero-Shot Instance Segmentation (3DV 2025)
☆171Apr 17, 2025Updated last year
zju3dv / BoxDreamer
View on GitHub
Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.
☆108Oct 6, 2025Updated 9 months ago
TEA-Lab / Robo-ABC
View on GitHub
[ECCV 2024] 🎉 Official repository of "Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipu…
☆101Nov 26, 2024Updated last year
hahamyt / clickattention
View on GitHub
ClickAttention: Click Region Similarity Guided Interactive Segmentation
☆23Jan 3, 2025Updated last year
Ghostish / ObjectCentricOccCompletion
View on GitHub
Official Code Release for "Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection" in NeurIPS 2…
☆30Apr 20, 2025Updated last year
Pointcept / OpenIns3D
View on GitHub
[ECCV'24] OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation
☆207Oct 19, 2024Updated last year
wangzy22 / XMask3D
View on GitHub
[NeurIPS 2024] XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation
☆37Jan 20, 2025Updated last year
fudan-zvg / UniUGG
View on GitHub
UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding. Accepted to ICLR 2026.
☆63Updated this week
ShenzheZhu / JailDAM
View on GitHub
[COLM 2025] JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model
☆26Nov 25, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ayushjain1144 / odin
View on GitHub
Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)
☆177Feb 27, 2026Updated 4 months ago
AadSah / kyvo
View on GitHub
This repository contains the code for the paper - "Aligning Text, Images, and 3D Structure Token-by-Token" (CVPR 2026)
☆49Jun 11, 2025Updated last year
zyc00 / PartSLIP2
View on GitHub
☆50May 18, 2024Updated 2 years ago
xuxw98 / ESAM
View on GitHub
[ICLR 2025, Oral] EmbodiedSAM: Online Segment Any 3D Thing in Real Time
☆634May 7, 2025Updated last year
lslrh / DMA
View on GitHub
Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024
☆32Jul 18, 2024Updated 2 years ago
eslambakr / CoT3D_VG
View on GitHub
Chain_of_Thoughts_3D_Visual_Grounding
☆21Apr 20, 2024Updated 2 years ago
taco-group / AutoTrust
View on GitHub
[TMLR'25] AutoTrust, a groundbreaking benchmark designed to assess the trustworthiness of DriveVLMs. This work aims to enhance public saf…
☆54Nov 20, 2025Updated 8 months ago