dvlab-research / 3D-Box-Segment-AnythingLinks
We extend Segment Anything to 3D perception by combining it with VoxelNeXt.
☆565Updated 2 years ago
Alternatives and similar repositories for 3D-Box-Segment-Anything
Users that are interested in 3D-Box-Segment-Anything are comparing it to the libraries listed below
Sorting:
- VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)☆851Updated 2 years ago
- [NeurIPS'23 Spotlight] Segment Any Point Cloud Sequences by Distilling Vision Foundation Models☆637Updated last year
- [CVPR 2022] "MonoScene: Monocular 3D Semantic Scene Completion": 3D Semantic Occupancy Prediction from a single image☆791Updated last year
- Focal Sparse Convolutional Networks for 3D Object Detection (CVPR 2022, Oral)☆390Updated 2 years ago
- Code for a series of work in LiDAR perception, including SST (CVPR 22), FSD (NeurIPS 22), FSD++ (TPAMI 23), FSDv2, and CTRL (ICCV 23, or…☆868Updated 11 months ago
- ☆164Updated 2 years ago
- [ICCV 2023] The first DETR model for monocular 3D object detection with depth-guided transformer☆431Updated 5 months ago
- [ICCV 2023] OccNet: Scene as Occupancy☆648Updated 5 months ago
- [ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection & [ICCV2023] PETRv2: A Unified Framework for 3D Per…☆1,005Updated 2 years ago
- [CVPR2023] Official Implementation of "DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets"☆435Updated last year
- https://arxiv.org/pdf/2202.02980☆148Updated 3 years ago
- [ICCV 2023] Cross Modal Transformer: Towards Fast and Robust 3D Object Detection☆398Updated 2 years ago
- [ECCV 2022 oral] Monocular 3D Object Detection with Depth from Motion☆319Updated 3 years ago
- [ICCV 2023] OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception☆682Updated last year
- CVPR2023-Occupancy-Prediction-Challenge☆856Updated 2 years ago
- Implementation for CenterFormer: Center-based Transformer for 3D Object Detection (ECCV 2022)☆304Updated 3 years ago
- [ICCV 2023] OccFormer: Dual-path Transformer for Vision-based 3D Semantic Occupancy Prediction☆390Updated 2 years ago
- Code of "OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments".☆352Updated 6 months ago
- Maybe the first academic open work on stereo 3D SSC method with vision-only input.☆311Updated 2 years ago
- LargeKernel3D: Scaling up Kernels in 3D Sparse CNNs (CVPR 2023)☆216Updated 2 years ago
- Official code for BEVStereo☆278Updated 3 years ago
- ☆182Updated 2 years ago
- Official implementation of our TIV'23 paper: Occupancy-MAE: Self-supervised Pre-training Large-scale LiDAR Point Clouds with Masked Occu…☆278Updated last year
- Unifying Voxel-based Representation with Transformer for 3D Object Detection (NeurIPS 2022)☆246Updated 3 years ago
- Vision-Centric BEV Perception: A Survey☆730Updated 2 years ago
- Awesome Monocular 3D detection☆436Updated last year
- Official PyTorch implementation of VoxFormer [CVPR 2023 Highlight]☆1,170Updated 2 years ago
- 2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds (ECCV 2022)☆457Updated 2 years ago
- [ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Bird’s-Eye-View Representation"☆343Updated last year
- A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities…☆269Updated last year