Divadi / SOLOFusion
Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection
☆252Updated 2 years ago
Alternatives and similar repositories for SOLOFusion:
Users that are interested in SOLOFusion are comparing it to the libraries listed below
- [AAAI 2023] PolarFormer: Multi-camera 3D Object Detection with Polar Transformers☆168Updated 2 years ago
- Official code for BEVStereo☆266Updated 2 years ago
- Unifying Voxel-based Representation with Transformer for 3D Object Detection (NeurIPS 2022)☆233Updated 2 years ago
- Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel Transformer☆235Updated 2 years ago
- [AAAI2024] Far3D: Expanding the Horizon for Surround-view 3D Object Detection☆165Updated last year
- 3D occupancy☆379Updated 2 years ago
- [ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Bird’s-Eye-View Representation"☆308Updated 6 months ago
- [NeurIPS 2022] DeepInteraction: 3D Object Detection via Modality Interaction☆233Updated last month
- (CVPR2023) CAPE: Camera View Position Embedding for Multi-View 3D Object Detection☆106Updated last year
- [ICCV 2023] SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection☆220Updated 7 months ago
- Code for "Object as Query: Lifting any 2D Object Detector to 3D Detection"☆97Updated last year
- Implementation for CenterFormer: Center-based Transformer for 3D Object Detection (ECCV 2022)☆300Updated 2 years ago
- [IROS 2024 Oral Presentation] WidthFormer: Toward Efficient Transformer-based BEV View Transformation☆141Updated 9 months ago
- Awesome papers about Multi-Camera Semantic Occupancy Prediction, such as TPVFormer, OccFormer, Occ3D, OpenOccupancy☆233Updated last year
- [ICLR2024] TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning☆180Updated 7 months ago
- [CVPR 2024] Learning Occupancy for Monocular 3D Object Detection☆121Updated last year
- UniPAD: A Universal Pre-training Paradigm for Autonomous Driving (CVPR 2024)☆183Updated 8 months ago
- Code for Paper, MUTR3D: A Multi-camera Tracking Framework via 3D-to-2D Queries. https://tsinghua-mars-lab.github.io/mutr3d/☆190Updated 2 years ago
- Target Inner-Geometry Learning for BEV 3D Object Detection☆89Updated last year
- [CVPR 2024] PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation☆163Updated 7 months ago
- Implementation of PF-Track☆221Updated last year
- [ICLR 2023] BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection☆121Updated last year
- (NeurlPS 2022) Towards Efficient 3D Object Detection with Knowledge Distillation☆119Updated last year
- [ECCV 2024] Lane Graph as Path: Continuity-preserving Path-wise Modeling for Online Lane Graph Construction☆138Updated 6 months ago
- ☆218Updated last year
- EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object Detection☆269Updated last year
- [ECCV 2022] Learning Ego 3D Representation as Ray Tracing☆105Updated 2 years ago
- Official implementation of our RAL'24 paper: Multi-Camera Unified Pre-training for Autonomous Driving☆223Updated last year
- Open Source 3D Occupancy Prediction Library.☆143Updated 2 years ago
- MonoDTR: Monocular 3D Object Detection with Depth-Aware Transformer (CVPR 2022)☆146Updated 4 months ago