Ivan-Tang-3D / ViewRefer3D
(ICCV2023) Official implementation of 'ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance'
☆56Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for ViewRefer3D
- Reasoning 3D Segmentation - "segment anything"/grounding/part seperation in 3D with natural conversations.☆75Updated 5 months ago
- [ICCV23] Bird’s-Eye-View Scene Graph for Vision-Language Navigation☆117Updated 7 months ago
- [CVPR24] Volumetric Environment Representation for Vision-Language Navigation☆77Updated 2 months ago
- [ICCV23] DQS3D: Densely-matched Quantization-aware Semi-supervised 3D Detection☆91Updated 10 months ago
- [NeurIPS 2022] TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun Distillation☆122Updated last year
- Official Codebase of "DiffComplete: Diffusion-based Generative 3D Shape Completion"☆84Updated 3 months ago
- Repository of our CVPR2023 paper "Lana: A Language-Capable Navigator for Instruction Following and Generation"☆97Updated last year
- ☆70Updated last year
- ☆85Updated 3 weeks ago
- Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud Learning☆117Updated 3 weeks ago
- Chain-of-Spot: Interactive Reasoning Improves Large Vision-language Models☆85Updated 7 months ago
- [ICRA 2023] From Semi-supervised to Omni-supervised Room Layout Estimation Using Point Clouds☆111Updated last year
- [ICRA 2023] ConDA: Unsupervised Domain Adaptation for LiDAR Segmentation via Regularized Domain Concatenation☆59Updated last year
- [ICCV 2023 Oral] Pytorch Implementation☆98Updated last year
- SceneTracker: Long-term Scene Flow Estimation Network☆105Updated 4 months ago
- Monocular Depth Estimation Toolbox and Benchmark. [Arxiv'24 ScaleDepth, TIP'24 Binsformer]☆68Updated 2 weeks ago
- The official generation code and toolkits of VDW dataset (ICCV 2023)☆43Updated 4 months ago
- EmbodiedSAM: Online Segment Any 3D Thing in Real Time☆224Updated 2 weeks ago
- This is the official Pytorch implementation of our paper "PointNorm: Normalization is All You Need for Point Cloud Analysis""☆56Updated last year
- Code release for "UniVS: Unified and Universal Video Segmentation with Prompts as Queries" (CVPR2024)☆170Updated 4 months ago
- [CVPR 2023] Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders☆218Updated last year
- [ECCV2022] Learning Quality-aware Dynamic Memory for Video Object Segmentation☆142Updated last year
- [Neurips 2023] dynpoint: dynamic neural point for view synthesis☆72Updated 9 months ago
- [NeurIPS 2022] Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training☆205Updated last year
- [NeurIPS 2024] A Unified Framework for 3D Scene Understanding☆101Updated last week
- ☆38Updated 5 months ago
- Code of 3DMIT: 3D MULTI-MODAL INSTRUCTION TUNING FOR SCENE UNDERSTANDING☆24Updated 3 months ago
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆44Updated 3 months ago
- Official implementation of Bootstrap3D: Improving Multi-view Diffusion Model with Synthetic Data☆84Updated 3 weeks ago
- This is a PyTorch implementation of 3DRefTR proposed by our paper "A Unified Framework for 3D Point Cloud Visual Grounding"☆19Updated last year