fudan-zvg / PolarFormer
[AAAI 2023] PolarFormer: Multi-camera 3D Object Detection with Polar Transformers
☆160Updated last year
Related projects: ⓘ
- Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection☆236Updated last year
- Unifying Voxel-based Representation with Transformer for 3D Object Detection (NeurIPS 2022)☆227Updated last year
- [AAAI2024] Far3D: Expanding the Horizon for Surround-view 3D Object Detection☆135Updated 9 months ago
- Official code for BEVStereo☆257Updated last year
- Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel Transformer☆222Updated last year
- Code for Paper, MUTR3D: A Multi-camera Tracking Framework via 3D-to-2D Queries. https://tsinghua-mars-lab.github.io/mutr3d/☆177Updated last year
- Official implementation of our RAL'24 paper: Multi-Camera Unified Pre-training for Autonomous Driving☆202Updated 7 months ago
- [ICCV 2023] SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection☆187Updated last month
- (CVPR2023) CAPE: Camera View Position Embedding for Multi-View 3D Object Detection☆101Updated last year
- [NeurIPS 2022] DeepInteraction: 3D Object Detection via Modality Interaction☆217Updated 2 weeks ago
- [ICLR2024] TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning☆158Updated 3 weeks ago
- [IROS 2024 Oral Presentation] WidthFormer: Toward Efficient Transformer-based BEV View Transformation☆124Updated 3 months ago
- Target Inner-Geometry Learning for BEV 3D Object Detection☆87Updated last year
- [ICLR 2023] BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection☆110Updated last year
- Awesome papers about Multi-Camera Semantic Occupancy Prediction, such as TPVFormer, OccFormer, Occ3D, OpenOccupancy☆208Updated last year
- [CVPR2021] PointAugmenting: Cross-Modal Augmentation for 3D Object Detection☆109Updated 2 years ago
- Vision-based 3D occupancy prediction in autonomous driving: a review and outlook☆139Updated 2 months ago
- [CVPR 2024] Learning Occupancy for Monocular 3D Object Detection☆101Updated last year
- ☆77Updated this week
- [ECCV2022, IJCAI2022] AutoAlignV2: Deformable Feature Aggregation for Dynamic Multi-Modal 3D Object Detection☆143Updated last year
- An official code release of our CVPR'23 paper, BEVHeight☆193Updated last month
- [ECCV 2024] Lane Graph as Path: Continuity-preserving Path-wise Modeling for Online Lane Graph Construction☆119Updated last week
- 3D occupancy☆348Updated last year
- Implementation of PF-Track☆196Updated last year
- [CVPR 2024] PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation☆132Updated last month
- ☆187Updated 6 months ago
- https://arxiv.org/pdf/2202.02980☆142Updated last year
- [ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Bird’s-Eye-View Representation"☆276Updated 2 weeks ago
- ☆37Updated last year
- Source code of PivotNet (ICCV2023, PivotNet: Vectorized Pivot Learning for End-to-end HD Map Construction)☆96Updated 5 months ago