Open3DIS: Open-vocabulary 3D Instance Segmentation with 2D Mask Guidance (CVPR 2024)
β121Nov 12, 2024Updated last year
Alternatives and similar repositories for Open3DIS
Users that are interested in Open3DIS are comparing it to the libraries listed below
Sorting:
- [ICLR 2025 (Oral π’) ] Our OpenYOLO3D model achieves state-of-the-art performance in Open Vocabulary 3D Instance Segmentation on ScanNet2β¦β240Mar 17, 2025Updated last year
- [CVPR 2024] SAI3D: Segment Any Instance in 3D Scenesβ155Mar 29, 2024Updated last year
- Open-Vocabulary SAM3D: Understand Any 3D Sceneβ40Jun 9, 2025Updated 9 months ago
- [ECCV'24] OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentationβ206Oct 19, 2024Updated last year
- [CVPR 24] MaskClustering: View Consensus based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentationβ122Apr 25, 2024Updated last year
- β255Dec 15, 2023Updated 2 years ago
- β97Dec 29, 2024Updated last year
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"β84Aug 2, 2024Updated last year
- Mask3D predicts accurate 3D semantic instances achieving state-of-the-art on ScanNet, ScanNet200, S3DIS and STPLS3D.β717Oct 29, 2023Updated 2 years ago
- [CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabulariesβ801Oct 27, 2023Updated 2 years ago
- [NeurIPS 2024] A Unified Framework for 3D Scene Understandingβ173Jul 7, 2025Updated 8 months ago
- This is the official repository for OVIR-3D: Open-Vocabulary 3D Instance Retrieval Without Training on 3D Data. (CoRL'23)β112Nov 10, 2023Updated 2 years ago
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024β31Jul 18, 2024Updated last year
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"β14Jul 4, 2024Updated last year
- ImOV3D: Learning Open Vocabulary Point Clouds 3D Object Detection from Only 2D Images (NeurIPS2024)β90Feb 20, 2026Updated last month
- (CVPR 2023) PLA: Language-Driven Open-Vocabulary 3D Scene Understanding & (CVPR2024) RegionPLC: Regional Point-Language Contrastive Learnβ¦β298Jun 28, 2024Updated last year
- Chain_of_Thoughts_3D_Visual_Groundingβ19Apr 20, 2024Updated last year
- [ICLR 2025] Official code of "Segment any 3D Object with Language"β71Oct 11, 2025Updated 5 months ago
- ISBNet: a 3D Point Cloud Instance Segmentation Network with Instance-aware Sampling and Box-aware Dynamic Convolution (CVPR 2023)β161Nov 12, 2024Updated last year
- Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)β179Feb 27, 2026Updated 3 weeks ago
- [ACM MM-2024] RefMask3D: Language-Guided Transformer for 3D Referring Segmentationβ66Jul 29, 2024Updated last year
- [ICCV 2025] SAS: Segment Any 3D Scene with Integrated 2D Priorsβ31Jun 25, 2025Updated 8 months ago
- MINSU3D: MinkowskiEngine-powered Scene Understanding in 3Dβ42Jun 24, 2024Updated last year
- Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"β278Mar 19, 2025Updated last year
- [NeurIPS 2024] XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentationβ36Jan 20, 2025Updated last year
- [ECCV 2024] 4D Contrastive Superflows are Dense 3D Representation Learnersβ51Dec 4, 2025Updated 3 months ago
- SAMPro3D: Locating SAM Prompts in 3D for Zero-Shot Instance Segmentation (3DV 2025)β159Apr 17, 2025Updated 11 months ago
- Official repostory of the paper: Masked Scene Modeling (CVPR 2025)β17Dec 13, 2025Updated 3 months ago
- β98Mar 25, 2024Updated last year
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentationβ125Jan 11, 2024Updated 2 years ago
- β49Oct 27, 2023Updated 2 years ago
- [CVPR 2024] Memory-based Adapters for Online 3D Scene Perceptionβ125Mar 25, 2025Updated 11 months ago
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilitiesβ81Oct 10, 2024Updated last year
- Official implemetation of the paper "Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting".β249Aug 29, 2024Updated last year
- [CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Languβ¦β312Jul 17, 2024Updated last year
- pytorch implementation of "Efficiently Reconstructing Dynamic Scenes One π― D4RT at a Time"β48Jan 27, 2026Updated last month
- β10Oct 18, 2024Updated last year
- Language-Aligned Waypoint (LAW) Supervision for Vision-and-Language Navigation in Continuous Environmentsβ12Nov 29, 2021Updated 4 years ago
- A Framework for Open-Vocabulary Object Retrieval and Drawer Manipulation in Point Cloudsβ29Jan 19, 2025Updated last year