eslambakr / CoT3D_VGLinks
Chain_of_Thoughts_3D_Visual_Grounding
☆19Updated last year
Alternatives and similar repositories for CoT3D_VG
Users that are interested in CoT3D_VG are comparing it to the libraries listed below
Sorting:
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024☆31Updated last year
- ☆63Updated 2 years ago
- [TNNLS] Toward Explainable and Fine-Grained 3D Grounding through Referring Textual Phrases☆15Updated 6 months ago
- This is the code related to "Context-aware Alignment and Mutual Masking for 3D-Language Pre-training" (CVPR 2023).☆29Updated 2 years ago
- ☆96Updated last year
- Code release for our NeurIPS 2023 paper "Uni3DETR: Unified 3D Detection Transformer", our ECCV 2024 paper "OV-Uni3DETR: Towards Unified O…☆115Updated last year
- PyTorch implementation for our ICCV 2023 paper Not Every Side Is Equal: Localization Uncertainty Estimation for Semi-Supervised 3D Object…☆13Updated last year
- [AAAI 2023 Oral] Language-Assisted 3D Feature Learning for Semantic Scene Understanding☆12Updated 2 years ago
- This is a PyTorch implementation of 3DRefTR proposed by our paper "A Unified Framework for 3D Point Cloud Visual Grounding"☆25Updated 2 years ago
- [AAAI 2024] Mono3DVG: 3D Visual Grounding in Monocular Images, AAAI, 2024☆65Updated last year
- ☆25Updated 3 years ago
- [ICCV 2023] Multi3DRefer: Grounding Text Description to Multiple 3D Objects☆93Updated 3 months ago
- [ICCV2025] All in One: Visual-Description-Guided Unified Point Cloud Segmentation☆27Updated 5 months ago
- [ICCV2021] 3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds☆43Updated 3 years ago
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆61Updated last year
- [ICLR 2025] Official code of "Segment any 3D Object with Language"☆53Updated 3 months ago
- [ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding☆44Updated 3 years ago
- Open-Vocabulary SAM3D: Understand Any 3D Scene☆36Updated 7 months ago
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆116Updated 7 months ago
- ☆49Updated 2 years ago
- [CVPR'24] MiKASA: Multi-Key-Anchor & Scene-Aware Transformer for 3D Visual Grounding☆18Updated last year
- Official PyTorch codes for "Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation", ECCV2024☆30Updated last year
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"☆14Updated last year
- [ECCV 2024] The official PyTorch implementation of the "Part2Object: Hierarchical Unsupervised 3D Instance Segmentation".☆24Updated last year
- [CVPR 2023] EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding☆131Updated 2 years ago
- [CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds☆56Updated 2 years ago
- ☆21Updated 9 months ago
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"☆83Updated last year
- (AAAI2024) Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models☆57Updated last year
- [ICCV 2023] Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models☆44Updated last year