[ICLR 2025 (Oral π’) ] Our OpenYOLO3D model achieves state-of-the-art performance in Open Vocabulary 3D Instance Segmentation on ScanNet200 and Replica datasets with up βΌ16x speedup compared to the best existing method in literature.
β243Mar 17, 2025Updated last year
Alternatives and similar repositories for OpenYOLO3D
Users that are interested in OpenYOLO3D are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Open3DIS: Open-vocabulary 3D Instance Segmentation with 2D Mask Guidance (CVPR 2024)β121Nov 12, 2024Updated last year
- [ECCV'24] OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentationβ207Oct 19, 2024Updated last year
- [CVPR 24] MaskClustering: View Consensus based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentationβ122Apr 25, 2024Updated last year
- β17Jul 18, 2024Updated last year
- β258Dec 15, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [CVPR 2024] SAI3D: Segment Any Instance in 3D Scenesβ156Mar 29, 2024Updated 2 years ago
- [ICLR 2025] Official code of "Segment any 3D Object with Language"β71Apr 1, 2026Updated last week
- β98Dec 29, 2024Updated last year
- ImOV3D: Learning Open Vocabulary Point Clouds 3D Object Detection from Only 2D Images (NeurIPS2024)β92Feb 20, 2026Updated last month
- [NeurIPS 2024] XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentationβ37Jan 20, 2025Updated last year
- [NeurIPS 2024] A Unified Framework for 3D Scene Understandingβ175Jul 7, 2025Updated 9 months ago
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024β32Jul 18, 2024Updated last year
- SceneFun3D ToolKitβ169Apr 17, 2025Updated 11 months ago
- [NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of β¦β68Dec 3, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [3DV 2026] Open Vocabulary Monocular 3D Object Detectionβ86Nov 25, 2025Updated 4 months ago
- [CVPR 2024] Open3DSG: Open-Vocabulary 3D Scene Graphs from Point Clouds with Queryable Objects and Open-Set Relationshipsβ151Sep 16, 2024Updated last year
- [CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabulariesβ810Oct 27, 2023Updated 2 years ago
- β31Jan 21, 2025Updated last year
- [ACM MM-2024] RefMask3D: Language-Guided Transformer for 3D Referring Segmentationβ66Jul 29, 2024Updated last year
- SAMPro3D: Locating SAM Prompts in 3D for Zero-Shot Instance Segmentation (3DV 2025)β160Apr 17, 2025Updated 11 months ago
- Chain_of_Thoughts_3D_Visual_Groundingβ20Apr 20, 2024Updated last year
- The Most Faithful Implementation of Segment Anything (SAM) in 3Dβ355Sep 11, 2024Updated last year
- Official code for NeurIPS2023 paper: CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Deteβ¦β220Mar 19, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- (CVPR 2023) PLA: Language-Driven Open-Vocabulary 3D Scene Understanding & (CVPR2024) RegionPLC: Regional Point-Language Contrastive Learnβ¦β300Jun 28, 2024Updated last year
- [ICCV 2025] SuperDec: 3D Scene Decomposition with β¨Superquadric Primitives.β189Dec 31, 2025Updated 3 months ago
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"β84Aug 2, 2024Updated last year
- β11Oct 29, 2024Updated last year
- [CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Groundingβ216Apr 21, 2025Updated 11 months ago
- This is the official repository for OVIR-3D: Open-Vocabulary 3D Instance Retrieval Without Training on 3D Data. (CoRL'23)β112Nov 10, 2023Updated 2 years ago
- [ICLR 2025, Oral] EmbodiedSAM: Online Segment Any 3D Thing in Real Timeβ628May 7, 2025Updated 11 months ago
- Open-Vocabulary SAM3D: Understand Any 3D Sceneβ41Jun 9, 2025Updated 10 months ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understandingβ100Feb 2, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official implementation of the paper "LangSplat: 3D Language Gaussian Splatting" [CVPR2024 Highlight]β1,022Oct 10, 2025Updated 6 months ago
- [CVPR 2024] Memory-based Adapters for Online 3D Scene Perceptionβ125Mar 25, 2025Updated last year
- Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)β179Feb 27, 2026Updated last month
- [ICCV2025] All in One: Visual-Description-Guided Unified Point Cloud Segmentationβ29Jul 25, 2025Updated 8 months ago
- [AAAI 2025] Official data and code for "TB-HSU: Hierarchical 3D Scene Understanding with Contextual Affordances"β15Sep 11, 2025Updated 7 months ago
- [RSS2024] Official implementation of "Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation"β451Jan 19, 2026Updated 2 months ago
- [ECCV'2024] Gaussian Grouping for open-world Anything reconstruction, segmentation and editing.β979Jul 4, 2024Updated last year