π Awesome lists of papers and codes about open-vocabulary perception, including both 3D and 2D
β65Jul 27, 2025Updated 8 months ago
Alternatives and similar repositories for Awesome-Open-Vocabulary-Perception
Users that are interested in Awesome-Open-Vocabulary-Perception are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π Awesome lists of papers and codes about Large Vision-Language Modelsβ13Apr 1, 2024Updated 2 years ago
- Official code for NeurIPS2023 paper: CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Deteβ¦β220Mar 19, 2026Updated last month
- Official codes for paper: 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for Indoor 3D Object β¦β163Mar 16, 2026Updated last month
- β98Mar 25, 2024Updated 2 years ago
- Official code for ICCV2023 paper: Learning Unified Decompositional and Compositional NeRF for Editable Novel View Synthesisβ34Dec 27, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"β14Jul 4, 2024Updated last year
- ImOV3D: Learning Open Vocabulary Point Clouds 3D Object Detection from Only 2D Images (NeurIPS2024)β92Feb 20, 2026Updated last month
- Code release for our NeurIPS 2023 paper "Uni3DETR: Unified 3D Detection Transformer", our ECCV 2024 paper "OV-Uni3DETR: Towards Unified Oβ¦β116Jul 29, 2024Updated last year
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Dataβ59Mar 25, 2024Updated 2 years ago
- [ICRA2023] D-Align: Dual Query Co-attention Network for 3D Object Detection Based on Multi-frame Point Cloud Sequenceβ15Jun 30, 2024Updated last year
- [ICLR 2026] This is the official implementation of PG-Occ: Progressive Gaussian Transformer with Anisotropy-aware Sampling for Open Vocabβ¦β33Feb 19, 2026Updated 2 months ago
- [ECCV'24] OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentationβ206Oct 19, 2024Updated last year
- code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"β63Aug 23, 2024Updated last year
- [CVPR 2025] Official code for "Synergizing Motion and Appearance: Multi-Scale Compensatory Codebooks for Talking Head Video Generation"β65Jun 6, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Improving performance of deep learning models for 3D point cloud semantic segmentation via attention mechanismsβ18Jul 8, 2022Updated 3 years ago
- (TPAMI 2024) A Survey on Open Vocabulary Learningβ997Dec 24, 2025Updated 3 months ago
- Official Repository of Personalized Visual Instruct Tuningβ34Mar 6, 2025Updated last year
- Official implementation of CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoor Object Detection from Multi-view Imagesβ21Jun 24, 2024Updated last year
- Official code for CVPR 2026 paper: VGGT-Det: Mining VGGT Internal Priors for Sensor-Geometry-Free Multi-View Indoor 3D Object Detectionβ108Updated this week
- Papers on occupation, including monocular and multi-view in autonomous driving scenariosβ40Apr 24, 2024Updated last year
- make KITTI velodyne lidar data, label for 3d to top view coordinatesβ12Feb 5, 2018Updated 8 years ago
- [CVPR 2025] GaussHDR: High Dynamic Range Gaussian Splatting via Learning Unified 3D and 2D Local Tone Mappingβ44Oct 22, 2025Updated 5 months ago
- [ICCV 2023] PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learningβ287Aug 12, 2025Updated 8 months ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- β104Jul 21, 2023Updated 2 years ago
- [CVPR 2024] SAI3D: Segment Any Instance in 3D Scenesβ156Mar 29, 2024Updated 2 years ago
- β11Jul 17, 2024Updated last year
- Visualizing point clouds with transparency in Switch-NeRF (ICLR2023)β13Mar 27, 2023Updated 3 years ago
- Official code for ECCV2024 paper: GScream: Learning 3D Geometry and Feature Consistent Gaussian Splatting for Object Removalβ104Nov 25, 2025Updated 4 months ago
- Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)β179Feb 27, 2026Updated last month
- Commonsense Prototype for Outdoor Unsupervised 3D Object Detection (CVPR 2024)β78Jul 7, 2025Updated 9 months ago
- Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual Labels. (CVPR2025)β40Dec 11, 2025Updated 4 months ago
- Official repository of SoftREPA: Aligning Text to Image in Diffusion Models is Easier Than You Thinkβ22Jun 5, 2025Updated 10 months ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [AAAI 2024] SPGroup3D: Superpoint Grouping Network for Indoor 3D Object Detectionβ43Apr 17, 2024Updated 2 years ago
- ICML2025, I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Modelsβ191Sep 7, 2025Updated 7 months ago
- β37Oct 21, 2022Updated 3 years ago
- CoPart (ICCV 2025): A part-based 3D generation framework & the first large-scale part-level 3D dataset.β201Jul 23, 2025Updated 8 months ago
- BSNet: Box-Supervised Simulation-assisted Mean Teacher for 3D Instance Segmentation (CVPR2024)β13Jul 11, 2024Updated last year
- Code for our IEEE TIP 2020 paper "Dynamic Feature Integration for Simultaneous Detection of Salient Object, Edge and Skeleton"β52Dec 13, 2021Updated 4 years ago
- PyTorch implementation for our ICCV 2023 paper Not Every Side Is Equal: Localization Uncertainty Estimation for Semi-Supervised 3D Objectβ¦β13May 27, 2024Updated last year