[ICLR 2025 (Oral π’) ] Our OpenYOLO3D model achieves state-of-the-art performance in Open Vocabulary 3D Instance Segmentation on ScanNet200 and Replica datasets with up βΌ16x speedup compared to the best existing method in literature.
β258Mar 17, 2025Updated last year
Alternatives and similar repositories for OpenYOLO3D
Users that are interested in OpenYOLO3D are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Open3DIS: Open-vocabulary 3D Instance Segmentation with 2D Mask Guidance (CVPR 2024)β133Nov 12, 2024Updated last year
- [ECCV'24] OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentationβ208Oct 19, 2024Updated last year
- [CVPR 24] MaskClustering: View Consensus based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentationβ129Apr 25, 2024Updated 2 years ago
- β17Jul 18, 2024Updated last year
- β264Dec 15, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [CVPR 2024] SAI3D: Segment Any Instance in 3D Scenesβ161Mar 29, 2024Updated 2 years ago
- [ICLR 2025] Official code of "Segment any 3D Object with Language"β73Apr 14, 2026Updated 2 months ago
- β98Dec 29, 2024Updated last year
- QuickSplat: Fast 3D Surface Reconstruction via Learned Gaussian Initializationβ23Nov 11, 2025Updated 7 months ago
- ImOV3D: Learning Open Vocabulary Point Clouds 3D Object Detection from Only 2D Images (NeurIPS2024)β94Feb 20, 2026Updated 4 months ago
- [NeurIPS 2024] XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentationβ37Jan 20, 2025Updated last year
- [NeurIPS 2024] A Unified Framework for 3D Scene Understandingβ177Jul 7, 2025Updated 11 months ago
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024β32Jul 18, 2024Updated last year
- SceneFun3D ToolKitβ176Apr 17, 2025Updated last year
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of β¦β68Dec 3, 2023Updated 2 years ago
- [3DV 2026] Open Vocabulary Monocular 3D Object Detectionβ95Apr 29, 2026Updated 2 months ago
- SAMPro3D: Locating SAM Prompts in 3D for Zero-Shot Instance Segmentation (3DV 2025)β170Apr 17, 2025Updated last year
- [CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabulariesβ832Oct 27, 2023Updated 2 years ago
- β30Jan 21, 2025Updated last year
- [CVPR 2024] Open3DSG: Open-Vocabulary 3D Scene Graphs from Point Clouds with Queryable Objects and Open-Set Relationshipsβ165Sep 16, 2024Updated last year
- [ACM MM-2024] RefMask3D: Language-Guided Transformer for 3D Referring Segmentationβ66Jul 29, 2024Updated last year
- Chain_of_Thoughts_3D_Visual_Groundingβ21Apr 20, 2024Updated 2 years ago
- The Most Faithful Implementation of Segment Anything (SAM) in 3Dβ358Sep 11, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official code for NeurIPS2023 paper CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detecβ¦β222May 28, 2026Updated last month
- (CVPR 2023) PLA: Language-Driven Open-Vocabulary 3D Scene Understanding & (CVPR2024) RegionPLC: Regional Point-Language Contrastive Learnβ¦β301Jun 28, 2024Updated 2 years ago
- [ICCV 2025] SuperDec: 3D Scene Decomposition with β¨Superquadric Primitives.β201Dec 31, 2025Updated 6 months ago
- β11Oct 29, 2024Updated last year
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"β85Aug 2, 2024Updated last year
- [CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Groundingβ222Apr 21, 2025Updated last year
- This is the official repository for OVIR-3D: Open-Vocabulary 3D Instance Retrieval Without Training on 3D Data. (CoRL'23)β113Nov 10, 2023Updated 2 years ago
- [ICLR 2025, Oral] EmbodiedSAM: Online Segment Any 3D Thing in Real Timeβ634May 7, 2025Updated last year
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understandingβ102Feb 2, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Open-Vocabulary SAM3D: Understand Any 3D Sceneβ44Jun 9, 2025Updated last year
- [CVPR 2024] Memory-based Adapters for Online 3D Scene Perceptionβ124Mar 25, 2025Updated last year
- Official implementation of the paper "LangSplat: 3D Language Gaussian Splatting" [CVPR2024 Highlight]β1,064Oct 10, 2025Updated 8 months ago
- [RSS2024] Official implementation of "Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation"β499Jan 19, 2026Updated 5 months ago
- [AAAI 2025] Official data and code for "TB-HSU: Hierarchical 3D Scene Understanding with Contextual Affordances"β15Sep 11, 2025Updated 9 months ago
- Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)β177Feb 27, 2026Updated 4 months ago
- [ECCV'2024] Gaussian Grouping for open-world Anything reconstruction, segmentation and editing.β1,019Jul 4, 2024Updated 2 years ago