yangcaoai / Awesome-Open-Vocabulary-PerceptionView external linksLinks
π Awesome lists of papers and codes about open-vocabulary perception, including both 3D and 2D
β64Jul 27, 2025Updated 6 months ago
Alternatives and similar repositories for Awesome-Open-Vocabulary-Perception
Users that are interested in Awesome-Open-Vocabulary-Perception are comparing it to the libraries listed below
Sorting:
- Official code for NeurIPS2023 paper: CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Deteβ¦β220Sep 10, 2025Updated 5 months ago
- Official codes for paper: 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detectiβ¦β161Oct 21, 2025Updated 3 months ago
- β98Mar 25, 2024Updated last year
- Official code for ICCV2023 paper: Learning Unified Decompositional and Compositional NeRF for Editable Novel View Synthesisβ34Dec 27, 2023Updated 2 years ago
- ImOV3D: Learning Open Vocabulary Point Clouds 3D Object Detection from Only 2D Images (NeurIPS2024)β88Oct 2, 2025Updated 4 months ago
- Code release for our NeurIPS 2023 paper "Uni3DETR: Unified 3D Detection Transformer", our ECCV 2024 paper "OV-Uni3DETR: Towards Unified Oβ¦β116Jul 29, 2024Updated last year
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"β14Jul 4, 2024Updated last year
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Dataβ59Mar 25, 2024Updated last year
- β56Mar 6, 2025Updated 11 months ago
- EfficientSAM + YOLO World base model for use with Autodistill.β10Feb 21, 2024Updated last year
- code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"β60Aug 23, 2024Updated last year
- β35Apr 4, 2024Updated last year
- [CVPR 2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detecβ¦β66Apr 4, 2025Updated 10 months ago
- [ICRA2023] D-Align: Dual Query Co-attention Network for 3D Object Detection Based on Multi-frame Point Cloud Sequenceβ15Jun 30, 2024Updated last year
- [ECCV'24] OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentationβ205Oct 19, 2024Updated last year
- Official implementation of CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoor Object Detection from Multi-view Imagesβ19Jun 24, 2024Updated last year
- DiffuBox: Refining 3D Object Detection with Point Diffusionβ20Mar 9, 2025Updated 11 months ago
- [CVPR 2025] Learning Class Prototypes for Unified Sparse Supervised 3D Object Detectionβ26Apr 28, 2025Updated 9 months ago
- [CVPR 2025] Official code for "Synergizing Motion and Appearance: Multi-Scale Compensatory Codebooks for Talking Head Video Generation"β66Jun 6, 2025Updated 8 months ago
- [AAAI 2022 Oral] Static-Dynamic Co-Teaching for Class-Incremental 3D Object Detectionβ25Nov 22, 2022Updated 3 years ago
- [ICCV 2023] PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learningβ284Aug 12, 2025Updated 6 months ago
- β65Jan 15, 2026Updated last month
- Official Repository of Personalized Visual Instruct Tuningβ34Mar 6, 2025Updated 11 months ago
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scrollβ¦β27May 16, 2024Updated last year
- Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual Labels. (CVPR2025)β36Dec 11, 2025Updated 2 months ago
- [ECCV 2024] π Official repository of "Robo-ABC: Affordance Generalization Beyond Categories via Semantic Correspondence for Robot Manipuβ¦β96Nov 26, 2024Updated last year
- Code for our IEEE TIP 2020 paper "Dynamic Feature Integration for Simultaneous Detection of Salient Object, Edge and Skeleton"β52Dec 13, 2021Updated 4 years ago
- Official code for CFNetβ26May 17, 2024Updated last year
- Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)β179Oct 27, 2025Updated 3 months ago
- β44May 10, 2025Updated 9 months ago
- Transformation-Equivariant 3D Object Detection for Autonomous Drivingβ186May 3, 2024Updated last year
- Zero-shot RGB-D Point Cloud Registration with Pre-trained Large Vision Modelβ17Mar 15, 2025Updated 11 months ago
- Commonsense Prototype for Outdoor Unsupervised 3D Object Detection (CVPR 2024)β75Jul 7, 2025Updated 7 months ago
- Our 2nd-gen LMMβ34May 22, 2024Updated last year
- [CVPR 2024] SAI3D: Segment Any Instance in 3D Scenesβ153Mar 29, 2024Updated last year
- Open-vocabulary Semantic Segmentationβ33Feb 16, 2024Updated 2 years ago
- [AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraintsβ44Jul 2, 2025Updated 7 months ago
- νλμ½λ©μΌλ‘ μμ£Όμμ£Ό κ°λ¨ν μ±λ΄β10May 25, 2018Updated 7 years ago
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024β31Jul 18, 2024Updated last year