Megvii-BaseDetection / BEVStereoLinks
Official code for BEVStereo
☆278Updated 3 years ago
Alternatives and similar repositories for BEVStereo
Users that are interested in BEVStereo are comparing it to the libraries listed below
Sorting:
- Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection☆267Updated 2 years ago
- [AAAI 2023] PolarFormer: Multi-camera 3D Object Detection with Polar Transformers☆176Updated 2 years ago
- Unifying Voxel-based Representation with Transformer for 3D Object Detection (NeurIPS 2022)☆246Updated 3 years ago
- 3D occupancy☆397Updated 2 years ago
- Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel Transformer☆245Updated 2 years ago
- Code for Paper, MUTR3D: A Multi-camera Tracking Framework via 3D-to-2D Queries. https://tsinghua-mars-lab.github.io/mutr3d/☆204Updated 3 years ago
- Awesome papers about Multi-Camera Semantic Occupancy Prediction, such as TPVFormer, OccFormer, Occ3D, OpenOccupancy☆254Updated 2 years ago
- [NeurIPS 2022 & TPAMI 2025] DeepInteraction: 3D Object Detection via Modality Interaction☆249Updated 8 months ago
- Official implementation of our RAL'24 paper: Multi-Camera Unified Pre-training for Autonomous Driving☆234Updated last year
- https://arxiv.org/pdf/2202.02980☆149Updated 3 years ago
- [CVPR 2024] PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation☆183Updated last year
- [ICCV 2023] OccFormer: Dual-path Transformer for Vision-based 3D Semantic Occupancy Prediction☆389Updated 2 years ago
- Maybe the first academic open work on stereo 3D SSC method with vision-only input.☆311Updated 2 years ago
- MonoDTR: Monocular 3D Object Detection with Depth-Aware Transformer (CVPR 2022)☆153Updated last year
- (CVPR2023) CAPE: Camera View Position Embedding for Multi-View 3D Object Detection☆111Updated 2 years ago
- [T-PAMI 2022] DSGN++: Exploiting Visual-Spatial Relation for Stereo-based 3D Detectors☆91Updated 2 years ago
- [ECCV 2022 oral] Monocular 3D Object Detection with Depth from Motion☆318Updated 3 years ago
- [ICLR2024] TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning☆204Updated 3 months ago
- Delving into Localization Errors for Monocular 3D Object Detection, CVPR'2021☆179Updated 4 years ago
- Implementation for CenterFormer: Center-based Transformer for 3D Object Detection (ECCV 2022)☆304Updated 3 years ago
- [CVPR2021] PointAugmenting: Cross-Modal Augmentation for 3D Object Detection☆116Updated 3 years ago
- [AAAI2024] Far3D: Expanding the Horizon for Surround-view 3D Object Detection☆193Updated 2 years ago
- An official code release of our CVPR'23 paper, BEVHeight☆226Updated last year
- [ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Bird’s-Eye-View Representation"☆344Updated last year
- [ICLR 2023] BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection☆129Updated 2 years ago
- [ICCV 2023] Cross Modal Transformer: Towards Fast and Robust 3D Object Detection☆398Updated 2 years ago
- [CVPR 2024] Learning Occupancy for Monocular 3D Object Detection☆136Updated 2 years ago
- [IROS 2024 Oral Presentation] WidthFormer: Toward Efficient Transformer-based BEV View Transformation☆162Updated 8 months ago
- [CoRL 2022] SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation☆286Updated 2 years ago
- [ECCV 2024] Lane Graph as Path: Continuity-preserving Path-wise Modeling for Online Lane Graph Construction☆152Updated last year