dvlab-research / GroupContrast
[CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding
☆42Updated 6 months ago
Related projects: ⓘ
- ☆51Updated 10 months ago
- Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆39Updated 3 months ago
- Official code of "Segment any 3D Object with Language"☆35Updated 4 months ago
- Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆22Updated last week
- ☆34Updated 10 months ago
- Official implementation for [3DV 2024] `Pix4Point: Image Pretrained Standard Transformers for 3D Point Cloud Understanding`☆43Updated 2 months ago
- [ECCV 2024] 4D Contrastive Superflows are Dense 3D Representation Learners☆34Updated this week
- Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆42Updated 5 months ago
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆102Updated 8 months ago
- Open3DIS: Open-vocabulary 3D Instance Segmentation with 2D Mask Guidance (CVPR 2024)☆64Updated last month
- (AAAI2024) Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models☆42Updated 4 months ago
- M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts. Furthermore, M3DBe…☆54Updated 9 months ago
- Official Implementation for "Mask-Attention-Free Transformer for 3D Instance Segmentation"☆59Updated last year
- Open-Vocabulary SAM3D: Understand Any 3D Scene☆24Updated 2 weeks ago
- [NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding☆62Updated 5 months ago
- [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Ra…☆72Updated last month
- [MM 2024] [Need a RTX 3090] MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors☆60Updated last week
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆13Updated 2 months ago
- [ECCV2024] Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding☆91Updated 2 months ago
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities☆43Updated 2 months ago
- 4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)☆84Updated 4 months ago
- ☆31Updated 5 months ago
- X-3D: Explicit 3D Structure Modeling for Point Cloud Recognition (CVPR2024)☆18Updated 5 months ago
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024☆22Updated 2 months ago
- Pixel-Aligned Recurrent Queries for Multi-View 3D Object Detection (ICCV23)☆35Updated 11 months ago
- [CVPR 2023] MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training☆47Updated last year
- ☆82Updated 5 months ago
- PoinTramba: A Hybrid Transformer-Mamba Framework for Point Cloud Analysis☆39Updated 3 months ago
- [AAAI 2024] SPGroup3D: Superpoint Grouping Network for Indoor 3D Object Detection☆34Updated 5 months ago
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆36Updated last month