fudan-zvg / PolarFormerLinks
[AAAI 2023] PolarFormer: Multi-camera 3D Object Detection with Polar Transformers
☆174Updated 2 years ago
Alternatives and similar repositories for PolarFormer
Users that are interested in PolarFormer are comparing it to the libraries listed below
Sorting:
- Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection☆263Updated 2 years ago
- Unifying Voxel-based Representation with Transformer for 3D Object Detection (NeurIPS 2022)☆243Updated 2 years ago
- [NeurIPS 2022 & TPAMI 2025] DeepInteraction: 3D Object Detection via Modality Interaction☆245Updated 5 months ago
- [ICLR 2023] BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection☆126Updated 2 years ago
- Code for "Object as Query: Lifting any 2D Object Detector to 3D Detection"☆109Updated 2 years ago
- (CVPR2023) CAPE: Camera View Position Embedding for Multi-View 3D Object Detection☆109Updated 2 years ago
- Official code for BEVStereo☆276Updated 3 years ago
- Target Inner-Geometry Learning for BEV 3D Object Detection☆90Updated 2 years ago
- [ECCV2022, IJCAI2022] AutoAlignV2: Deformable Feature Aggregation for Dynamic Multi-Modal 3D Object Detection☆156Updated 2 years ago
- [CVPR 2024] Learning Occupancy for Monocular 3D Object Detection☆131Updated 2 years ago
- ☆66Updated 2 years ago
- [AAAI2024] Far3D: Expanding the Horizon for Surround-view 3D Object Detection☆183Updated last year
- [CVPR2021] PointAugmenting: Cross-Modal Augmentation for 3D Object Detection☆114Updated 3 years ago
- Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection (ECCV 2022 Oral)☆113Updated 2 years ago
- StreamPETR with 3dppe Extension☆51Updated last year
- Official implementation of our RAL'24 paper: Multi-Camera Unified Pre-training for Autonomous Driving☆230Updated last year
- Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel Transformer☆243Updated 2 years ago
- Code for Paper, MUTR3D: A Multi-camera Tracking Framework via 3D-to-2D Queries. https://tsinghua-mars-lab.github.io/mutr3d/☆200Updated 2 years ago
- [ICCV 2023] SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection☆241Updated last year
- [ECCV 2024] Ray Denoising (RayDN): Depth-aware Hard Negative Sampling for Multi-view 3D Object Detection☆115Updated last year
- 3D occupancy☆392Updated 2 years ago
- [IROS 2024 Oral Presentation] WidthFormer: Toward Efficient Transformer-based BEV View Transformation☆151Updated 6 months ago
- An Efficient, Flexible, and General deep learning framework that retains minimal.☆125Updated last year
- An official code release of our CVPR'23 paper, BEVHeight☆222Updated last year
- MonoDTR: Monocular 3D Object Detection with Depth-Aware Transformer (CVPR 2022)☆148Updated 11 months ago
- [ICLR2024] TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning☆199Updated last month
- [ECCV 2024] Lane Graph as Path: Continuity-preserving Path-wise Modeling for Online Lane Graph Construction☆147Updated last year
- Official PyTorch implementation of FocalFormer3D [ICCV 2023]☆198Updated 9 months ago
- ☆71Updated 3 years ago
- [ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Bird’s-Eye-View Representation"☆338Updated last year