[CVPR 2025 Highlight🔥] Official code repository for "Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning"
☆129Jan 30, 2026Updated last month
Alternatives and similar repositories for Inst3D-LMM
Users that are interested in Inst3D-LMM are comparing it to the libraries listed below
Sorting:
- The official implementation of "Label-efficient Semantic Scene Completion with Scribble Annotations" (IJCAI 2024)☆14Jul 27, 2024Updated last year
- The official implementation of "Not All Voxels Are Equal: Hardness-Aware Semantic Scene Completion with Self-Distillation" (CVPR 2024)☆28Jul 27, 2024Updated last year
- [CVPR2025] The code for "Uncertainty-Instructed Structure Injection for Generalizable HD Map Construction."☆21Oct 19, 2025Updated 4 months ago
- Unifying 2D and 3D Vision-Language Understanding☆121Jul 23, 2025Updated 7 months ago
- Official implementation of paper "GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model", ICML 2025☆15Dec 25, 2025Updated 2 months ago
- ☆75Mar 29, 2025Updated 11 months ago
- The code for "Label-efficient Segmentation via Affinity Propagation". [NeurIPS2023]☆67Mar 4, 2024Updated 2 years ago
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024☆31Jul 18, 2024Updated last year
- The code for PixelRefer & VideoRefer☆346Nov 16, 2025Updated 3 months ago
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities☆81Oct 10, 2024Updated last year
- [CVPR2024] The code for "MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction"☆117Apr 13, 2024Updated last year
- The official implementation of "LiDAR2Map: In Defense of LiDAR-Based Semantic Map Construction Using Online Camera Distillation" (CVPR 20…☆91Apr 6, 2024Updated last year
- The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM", IJCV2025☆277May 26, 2025Updated 9 months ago
- 🌐 A Roadmap for 3D Scene Understanding in the Wild☆23Dec 19, 2025Updated 2 months ago
- [ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆373Oct 21, 2025Updated 4 months ago
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆118May 30, 2025Updated 9 months ago
- Official pytorch implementation of "XHand: Real-time Expressive Hand Avatar"☆82Jul 31, 2024Updated last year
- [NeurIPS 2025] 3DRS: MLLMs Need 3D-Aware Representation Supervision for Scene Understanding☆151Dec 9, 2025Updated 3 months ago
- [ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness☆67Jul 22, 2025Updated 7 months ago
- [NeurIPS 2024 Oral] RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation