[ICLR 2025 (Oral π’) ] Our OpenYOLO3D model achieves state-of-the-art performance in Open Vocabulary 3D Instance Segmentation on ScanNet200 and Replica datasets with up βΌ16x speedup compared to the best existing method in literature.
β245Mar 17, 2025Updated last year
Alternatives and similar repositories for OpenYOLO3D
Users that are interested in OpenYOLO3D are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Open3DIS: Open-vocabulary 3D Instance Segmentation with 2D Mask Guidance (CVPR 2024)β124Nov 12, 2024Updated last year
- [ECCV'24] OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentationβ204Oct 19, 2024Updated last year
- [CVPR 24] MaskClustering: View Consensus based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentationβ125Apr 25, 2024Updated 2 years ago
- β17Jul 18, 2024Updated last year
- β259Dec 15, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [CVPR 2024] SAI3D: Segment Any Instance in 3D Scenesβ157Mar 29, 2024Updated 2 years ago
- [ICLR 2025] Official code of "Segment any 3D Object with Language"β70Apr 14, 2026Updated 2 weeks ago
- β98Dec 29, 2024Updated last year
- QuickSplat: Fast 3D Surface Reconstruction via Learned Gaussian Initializationβ21Nov 11, 2025Updated 5 months ago
- ImOV3D: Learning Open Vocabulary Point Clouds 3D Object Detection from Only 2D Images (NeurIPS2024)β93Feb 20, 2026Updated 2 months ago
- [NeurIPS 2024] XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentationβ37Jan 20, 2025Updated last year
- [NeurIPS 2024] A Unified Framework for 3D Scene Understandingβ174Jul 7, 2025Updated 9 months ago
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024β32Jul 18, 2024Updated last year
- SceneFun3D ToolKitβ174Apr 17, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of β¦β68Dec 3, 2023Updated 2 years ago
- [3DV 2026] Open Vocabulary Monocular 3D Object Detectionβ86Updated this week
- SAMPro3D: Locating SAM Prompts in 3D for Zero-Shot Instance Segmentation (3DV 2025)β162Apr 17, 2025Updated last year
- [CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabulariesβ815Oct 27, 2023Updated 2 years ago
- β30Jan 21, 2025Updated last year
- [CVPR 2024] Open3DSG: Open-Vocabulary 3D Scene Graphs from Point Clouds with Queryable Objects and Open-Set Relationshipsβ155Sep 16, 2024Updated last year
- [ACM MM-2024] RefMask3D: Language-Guided Transformer for 3D Referring Segmentationβ66Jul 29, 2024Updated last year
- Chain_of_Thoughts_3D_Visual_Groundingβ20Apr 20, 2024Updated 2 years ago
- The Most Faithful Implementation of Segment Anything (SAM) in 3Dβ356Sep 11, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official code for NeurIPS2023 paper: CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Deteβ¦β220Apr 17, 2026Updated 2 weeks ago
- (CVPR 2023) PLA: Language-Driven Open-Vocabulary 3D Scene Understanding & (CVPR2024) RegionPLC: Regional Point-Language Contrastive Learnβ¦β299Jun 28, 2024Updated last year
- [ICCV 2025] SuperDec: 3D Scene Decomposition with β¨Superquadric Primitives.β192Dec 31, 2025Updated 4 months ago
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"β84Aug 2, 2024Updated last year
- β11Oct 29, 2024Updated last year
- [CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Groundingβ220Apr 21, 2025Updated last year
- This is the official repository for OVIR-3D: Open-Vocabulary 3D Instance Retrieval Without Training on 3D Data. (CoRL'23)β112Nov 10, 2023Updated 2 years ago
- [ICLR 2025, Oral] EmbodiedSAM: Online Segment Any 3D Thing in Real Timeβ627May 7, 2025Updated 11 months ago
- Open-Vocabulary SAM3D: Understand Any 3D Sceneβ41Jun 9, 2025Updated 10 months ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understandingβ100Feb 2, 2025Updated last year
- [CVPR 2024] Memory-based Adapters for Online 3D Scene Perceptionβ124Mar 25, 2025Updated last year
- Official implementation of the paper "LangSplat: 3D Language Gaussian Splatting" [CVPR2024 Highlight]β1,040Oct 10, 2025Updated 6 months ago
- [RSS2024] Official implementation of "Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation"β465Jan 19, 2026Updated 3 months ago
- [AAAI 2025] Official data and code for "TB-HSU: Hierarchical 3D Scene Understanding with Contextual Affordances"β15Sep 11, 2025Updated 7 months ago
- Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)β181Feb 27, 2026Updated 2 months ago
- [ECCV'2024] Gaussian Grouping for open-world Anything reconstruction, segmentation and editing.β989Jul 4, 2024Updated last year