π Awesome lists of papers and codes about open-vocabulary perception, including both 3D and 2D
β67Jul 27, 2025Updated 10 months ago
Alternatives and similar repositories for Awesome-Open-Vocabulary-Perception
Users that are interested in Awesome-Open-Vocabulary-Perception are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official code for NeurIPS2023 paper CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detecβ¦β220Updated this week
- Official codes for paper: 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for Indoor 3D Object β¦β164Mar 16, 2026Updated 2 months ago
- β98Mar 25, 2024Updated 2 years ago
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"β14Jul 4, 2024Updated last year
- ImOV3D: Learning Open Vocabulary Point Clouds 3D Object Detection from Only 2D Images (NeurIPS2024)β94Feb 20, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code release for our NeurIPS 2023 paper "Uni3DETR: Unified 3D Detection Transformer", our ECCV 2024 paper "OV-Uni3DETR: Towards Unified Oβ¦β118Jul 29, 2024Updated last year
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Dataβ61Mar 25, 2024Updated 2 years ago
- β13Jun 4, 2025Updated 11 months ago
- [ECCV'24] OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentationβ206Oct 19, 2024Updated last year
- [ICLR 2026] This is the official implementation of PG-Occ: Progressive Gaussian Transformer with Anisotropy-aware Sampling for Open Vocabβ¦β33Feb 19, 2026Updated 3 months ago
- code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"β63Aug 23, 2024Updated last year
- [CVPR 2025] Official code for "Synergizing Motion and Appearance: Multi-Scale Compensatory Codebooks for Talking Head Video Generation"β65Jun 6, 2025Updated 11 months ago
- Improving performance of deep learning models for 3D point cloud semantic segmentation via attention mechanismsβ18Jul 8, 2022Updated 3 years ago
- β34Apr 4, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- (TPAMI 2024) A Survey on Open Vocabulary Learningβ999May 12, 2026Updated 2 weeks ago
- Official Repository of Personalized Visual Instruct Tuningβ34Mar 6, 2025Updated last year
- Official code for CVPR 2026 paper: VGGT-Det: Mining VGGT Internal Priors for Sensor-Geometry-Free Multi-View Indoor 3D Object Detectionβ124Apr 14, 2026Updated last month
- Papers on occupation, including monocular and multi-view in autonomous driving scenariosβ40Apr 24, 2024Updated 2 years ago
- make KITTI velodyne lidar data, label for 3d to top view coordinatesβ12Feb 5, 2018Updated 8 years ago
- [CVPR 2025] GaussHDR: High Dynamic Range Gaussian Splatting via Learning Unified 3D and 2D Local Tone Mappingβ47Oct 22, 2025Updated 7 months ago
- β11Jul 17, 2024Updated last year
- [CVPR 2024] SAI3D: Segment Any Instance in 3D Scenesβ159Mar 29, 2024Updated 2 years ago
- Visualizing point clouds with transparency in Switch-NeRF (ICLR2023)β13Mar 27, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official code for ECCV2024 paper: GScream: Learning 3D Geometry and Feature Consistent Gaussian Splatting for Object Removalβ104Nov 25, 2025Updated 6 months ago
- Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)β177Feb 27, 2026Updated 3 months ago
- One4D: Unified 4D Generation and Reconstructionβ95Dec 2, 2025Updated 5 months ago
- Official repository of SoftREPA: Aligning Text to Image in Diffusion Models is Easier Than You Thinkβ23Jun 5, 2025Updated 11 months ago
- Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual Labels. (CVPR2025)β41Dec 11, 2025Updated 5 months ago
- [AAAI 2024] SPGroup3D: Superpoint Grouping Network for Indoor 3D Object Detectionβ43Apr 17, 2024Updated 2 years ago
- Commonsense Prototype for Outdoor Unsupervised 3D Object Detection (CVPR 2024)β79Jul 7, 2025Updated 10 months ago
- [ICCV 2025] Language Driven Occupancy Predictionβ39Dec 23, 2024Updated last year
- EfficientSAM + YOLO World base model for use with Autodistill.β10Feb 21, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- BSNet: Box-Supervised Simulation-assisted Mean Teacher for 3D Instance Segmentation (CVPR2024)β14Jul 11, 2024Updated last year
- Code for our IEEE TIP 2020 paper "Dynamic Feature Integration for Simultaneous Detection of Salient Object, Edge and Skeleton"β52Dec 13, 2021Updated 4 years ago
- PyTorch implementation for our ICCV 2023 paper Not Every Side Is Equal: Localization Uncertainty Estimation for Semi-Supervised 3D Objectβ¦β13May 27, 2024Updated 2 years ago
- [CVPR'24] "Unsupervised Occupancy Learning from Sparse Point Cloud"β16Sep 25, 2024Updated last year
- [ICRA2024] FG-PFE: Fine-Grained Pillar Feature Encoding via Spatio-Temporal Virtual Grid for 3D Object Detectionβ14Feb 5, 2024Updated 2 years ago
- Transformation-Equivariant 3D Object Detection for Autonomous Drivingβ187May 3, 2024Updated 2 years ago
- The code release for "Variational Structured Attention Networks for Visual Dense Representation Learning"β14Nov 28, 2022Updated 3 years ago