xuxw98 / ESAM
[ICLR 2025, Oral] EmbodiedSAM: Online Segment Any 3D Thing in Real Time
☆296Updated this week
Alternatives and similar repositories for ESAM:
Users that are interested in ESAM are comparing it to the libraries listed below
- The Most Faithful Implementation of Segment Anything (SAM) in 3D☆297Updated 5 months ago
- Official implementation of "DepthLab: From Partial to Complete"☆434Updated this week
- [ICCV 2023 Oral] Pytorch Implementation☆88Updated last year
- [CVPR 2024] Memory-based Adapters for Online 3D Scene Perception☆110Updated 4 months ago
- [ECCV 2024] SGS-SLAM: Semantic Gaussian Splatting For Neural Dense SLAM☆167Updated last month
- Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud Learning☆131Updated 3 weeks ago
- [ECCV'24] OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation☆173Updated 3 months ago
- The implementation of SUNDAE: Spectrally Pruned Gaussian Fields with Neural Compensation☆162Updated 8 months ago
- [NeurIPS 2024] A Unified Framework for 3D Scene Understanding☆129Updated 2 months ago
- (CVPR 2023) PLA: Language-Driven Open-Vocabulary 3D Scene Understanding & (CVPR2024) RegionPLC: Regional Point-Language Contrastive Learn…☆270Updated 7 months ago
- Monocular Depth Estimation Toolbox and Benchmark. [Arxiv'24 ScaleDepth, TCSVT'24 Plane2Depth, TIP'24 Binsformer]☆73Updated 3 weeks ago
- [CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Langu…☆263Updated 6 months ago
- [ICLR 2025] Point-SAM: Promptable 3D Segmentation Model for Point Clouds☆170Updated last month
- Align 3D Point Cloud with Multi-modalities for Large Language Models☆420Updated last year
- [SCIS] SAM3D: Zero-Shot 3D Object Detection via Segment Anything Model☆202Updated last year
- Official Codebase of "DiffComplete: Diffusion-based Generative 3D Shape Completion"☆83Updated 6 months ago
- ☆135Updated last month
- [CVPR 2023] Learning 3D Representations from 2D Pre-trained Models via Image-to-Point Masked Autoencoders☆218Updated last year
- [ICRA 2023] From Semi-supervised to Omni-supervised Room Layout Estimation Using Point Clouds☆105Updated 2 years ago
- [CVPR 24] MaskClustering: View Consensus based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentation☆88Updated 9 months ago
- Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"☆223Updated last month
- Pytorch Code for "LEGaussians: Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding"☆128Updated 2 months ago
- [ICCV 2023] PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment☆670Updated 3 months ago
- Official implementation for SlimmeRF: Slimmable Radiance Fields (3DV 2024 Best Paper)☆145Updated 7 months ago
- Open3DIS: Open-vocabulary 3D Instance Segmentation with 2D Mask Guidance (CVPR 2024)☆87Updated 3 months ago
- [ICCV23] DQS3D: Densely-matched Quantization-aware Semi-supervised 3D Detection☆86Updated last year
- SceneTracker: Long-term Scene Flow Estimation Network☆99Updated 7 months ago
- [ICCV 2023] PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning☆244Updated last year
- Reasoning 3D Segmentation - "segment anything"/grounding/part seperation in 3D with natural conversations.☆76Updated 8 months ago
- [ARXIV'23] PanopticNeRF-360 | [3DV'22] Panoptic NeRF☆215Updated 2 months ago