CVRP-SOLE / SOLELinks
[ICLR 2025] Official code of "Segment any 3D Object with Language"
☆46Updated 4 months ago
Alternatives and similar repositories for SOLE
Users that are interested in SOLE are comparing it to the libraries listed below
Sorting:
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024☆29Updated 10 months ago
- ☆44Updated last year
- [WACV 2025 Oral] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆56Updated 3 months ago
- [3DV 2025] Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model☆86Updated last week
- Implementation of the project: SceneSplat - Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining☆28Updated 2 months ago
- [NeurIPS 2024] XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation☆34Updated 4 months ago
- ☆37Updated 10 months ago
- Open-Vocabulary SAM3D: Understand Any 3D Scene☆28Updated 9 months ago
- ☆85Updated 5 months ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆93Updated 4 months ago
- [NeurIPS 2023] Weakly Supervised 3D Open-vocabulary Segmentation☆119Updated last year
- [ECCV 2024] Official implementation of "RangeLDM: Fast Realistic LiDAR Point Cloud Generation"☆34Updated 6 months ago
- [CVPR 2024] 🏡Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning☆77Updated last year
- [ICCV 2023] Multi3DRefer: Grounding Text Description to Multiple 3D Objects☆84Updated last year
- Open3DIS: Open-vocabulary 3D Instance Segmentation with 2D Mask Guidance (CVPR 2024)☆100Updated 6 months ago
- Official PyTorch implementation of the paper ‘CLIP-GS: CLIP-Informed Gaussian Splatting for Real-time and View-consistent 3D Semantic Und…☆49Updated last year
- [ICLR'25] [3D-LLM] City-scale 3D Visual Grounding with Multi-modality LLMs☆47Updated 2 weeks ago
- ☆51Updated last year
- MINSU3D: MinkowskiEngine-powered Scene Understanding in 3D☆40Updated 11 months ago
- [ECCV 2024] Monocular Occupancy Prediction for Scalable Indoor Scenes☆58Updated 8 months ago
- ☆34Updated last year
- [AAAI 2024] SPGroup3D: Superpoint Grouping Network for Indoor 3D Object Detection☆41Updated last year
- Official PyTorch codes for "Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation", ECCV2024☆29Updated 10 months ago
- Code Release for ECCV 2024, "PCF-Lift: Panoptic Lifting by Probabilistic Contrastive Fusion"☆19Updated 2 months ago
- [CVPR 24] MaskClustering: View Consensus based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentation☆99Updated last year
- (AAAI2024) Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models☆52Updated last year
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"☆75Updated 10 months ago
- Paper: UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting☆17Updated 3 months ago
- [ECCV 2024] EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion.☆94Updated last year
- [CVPR24] Depth Prompting for Sensor-Agnostic Depth Estimation☆40Updated 10 months ago