ATR-DBI / CityRefer
☆36Updated last year
Alternatives and similar repositories for CityRefer:
Users that are interested in CityRefer are comparing it to the libraries listed below
- ☆40Updated last year
- [ICLR 2025] Official code of "Segment any 3D Object with Language"☆43Updated last month
- [CVPR 2024, Highlight] Living Scenes: Multi-object Relocalization and Reconstruction in Changing 3D Environments☆89Updated 8 months ago
- ☆51Updated last year
- ☆76Updated 2 months ago
- Official PyTorch implementation of the UrbanGIRAFFE@ICCV2023☆58Updated last year
- [NeurIPS 2023] VPP: Efficient Conditional 3D Generation via Voxel-Point Progressive Representation☆32Updated 8 months ago
- ☆65Updated last week
- [CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding☆43Updated last year
- [CVPR'24] Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery☆50Updated 3 months ago
- (AAAI2024) Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models☆52Updated 10 months ago
- [ICLR 2025 Spotlight] Multimodality Helps Few-shot 3D Point Cloud Semantic Segmentation☆28Updated 2 weeks ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆85Updated last month
- [WACV 2025 Oral] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆54Updated 3 weeks ago
- SceneFun3D ToolKit☆125Updated last week
- [CVPR 2023] Unsupervised Continual Semantic Adaptation through Neural Rendering☆37Updated last year
- 4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)☆105Updated last week
- [CVPR 2023] 3D Representation Learning via Foreground Aware Feature Contrast☆42Updated 11 months ago
- ☆53Updated 11 months ago
- Code Release for ECCV 2024, "PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion"☆18Updated 4 months ago
- [NeurIPS 2024] AlphaTablets: A Generic Plane Representation for 3D Planar Reconstruction from Monocular Videos☆23Updated 3 months ago
- 3DGraphLLM is a model that uses a 3D scene graph and an LLM to perform 3D vision-language tasks.☆49Updated 2 months ago
- Point Could Mamba: Point Cloud Learning via State Space Model☆69Updated 3 months ago
- ☆38Updated 9 months ago
- Code for "SAM-guided Graph Cut for 3D Instance Segmentation" ECCV 2024☆111Updated 2 months ago
- ☆32Updated 8 months ago
- ☆34Updated 11 months ago
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆65Updated 2 months ago