peoplelu / BSNetLinks
BSNet: Box-Supervised Simulation-assisted Mean Teacher for 3D Instance Segmentation (CVPR2024)
☆13Updated last year
Alternatives and similar repositories for BSNet
Users that are interested in BSNet are comparing it to the libraries listed below
Sorting:
- [CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding☆208Updated 9 months ago
- 3DV 2026 | CVPRW 2025 (T4V)☆89Updated 2 weeks ago
- [ICLR 2025 (Oral 📢) ] Our OpenYOLO3D model achieves state-of-the-art performance in Open Vocabulary 3D Instance Segmentation on ScanNet2…☆237Updated 10 months ago
- [CVPR 2024] Memory-based Adapters for Online 3D Scene Perception☆125Updated 10 months ago
- ☆38Updated 6 months ago
- PyTorch implementation of paper: GaussNav: Gaussian Splatting for Visual Navigation☆189Updated last year
- [NeurIPS 2024] A Unified Framework for 3D Scene Understanding☆170Updated 7 months ago
- [CVPR 2025] OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging☆40Updated 8 months ago
- [IROS 25] Dynamic 3D Gaussian Scene Graphs for Environment Adaptation☆71Updated last month
- [CVPR 2025] PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding☆95Updated 7 months ago
- Open3DIS: Open-vocabulary 3D Instance Segmentation with 2D Mask Guidance (CVPR 2024)☆115Updated last year
- [CVPR2025] CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos☆190Updated 4 months ago
- [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆205Updated last month
- [ICCV 2025] Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding☆70Updated last year
- [ECCV 2024] Monocular Occupancy Prediction for Scalable Indoor Scenes☆66Updated last year
- [CVPR 24] MaskClustering: View Consensus based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentation☆121Updated last year
- [3DV 2026] Open Vocabulary Monocular 3D Object Detection☆80Updated 2 months ago
- [ICCV 2025 Oral] SceneSplat - Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining☆305Updated last month
- [CVPR 2025] Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera☆294Updated 4 months ago
- [ICCV2025] Extrapolated Urban View Synthesis Benchmark☆47Updated 4 months ago
- LitePT: Lighter Yet Stronger Point Transformer☆205Updated last month
- [ICCV 2025] Detect Anything 3D in the Wild☆246Updated last month
- SLAM-Former: Putting SLAM into One Transformer☆418Updated 4 months ago
- Official implementation of "Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation" (NeurIPS'25 Oral)☆75Updated last month
- [ICCV 2025] IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation☆63Updated 6 months ago
- [ICLR 2026] Streaming 4D Visual Geometry Transformer☆821Updated 3 months ago
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆334Updated 5 months ago
- [AAAI2025] UniDet3D: Multi-dataset Indoor 3D Object Detection☆163Updated 8 months ago
- [ICCV 2025] 3DGraphLLM is a model that uses a 3D scene graph and an LLM to perform 3D vision-language tasks.☆103Updated last month
- Official implemetation of the paper "Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting".☆246Updated last year