ChenhongyiYang / WidthFormerLinks
[IROS 2024 Oral Presentation] WidthFormer: Toward Efficient Transformer-based BEV View Transformation
☆155Updated 7 months ago
Alternatives and similar repositories for WidthFormer
Users that are interested in WidthFormer are comparing it to the libraries listed below
Sorting:
- [AAAI2024] Far3D: Expanding the Horizon for Surround-view 3D Object Detection☆188Updated last year
- ☆45Updated 3 months ago
- [ECCV 2024] HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras☆57Updated last year
- An official code release of our CVPR'23 paper, BEVHeight☆223Updated last year
- Code for "Object as Query: Lifting any 2D Object Detector to 3D Detection"☆112Updated 2 years ago
- StreamPETR with 3dppe Extension☆51Updated last year
- Official code for MatrixVT on BEVDepth.☆46Updated 2 years ago
- [CVPR 2024] Learning Occupancy for Monocular 3D Object Detection☆135Updated 2 years ago
- Target Inner-Geometry Learning for BEV 3D Object Detection☆91Updated 2 years ago
- ☆57Updated last year
- [AAAI 2023] PolarFormer: Multi-camera 3D Object Detection with Polar Transformers☆175Updated 2 years ago
- Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel Transformer☆243Updated 2 years ago
- [ICLR2024] TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning☆202Updated 2 months ago
- ☆23Updated last year
- [ECCV 2024] Ray Denoising (RayDN): Depth-aware Hard Negative Sampling for Multi-view 3D Object Detection☆119Updated last year
- (CVPR2023) CAPE: Camera View Position Embedding for Multi-View 3D Object Detection☆109Updated 2 years ago
- DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)☆107Updated last year
- ☆69Updated 2 years ago
- Papers on occupation, including monocular and multi-view in autonomous driving scenarios☆40Updated last year
- Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection☆265Updated 2 years ago
- Source code of PivotNet (ICCV2023, PivotNet: Vectorized Pivot Learning for End-to-end HD Map Construction)☆121Updated last year
- [ICLR 2023] BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection☆129Updated 2 years ago
- Official implementation of our RAL'24 paper: Multi-Camera Unified Pre-training for Autonomous Driving☆232Updated last year
- Implemented BEVFormer support for BEV segmentation☆150Updated 2 years ago
- Official PyTorch implementation for paper`Anchor3DLane: Learning to Regress 3D Anchors for Monocular 3D Lane Detection' accepted by CVPR …☆174Updated 4 months ago
- [ECCV 2024] Lane Graph as Path: Continuity-preserving Path-wise Modeling for Online Lane Graph Construction☆151Updated last year
- POWERBEV, a novel and elegant vision-based end-to-end framework that only consists of 2D convolutional layers to perform perception and f…☆96Updated last year
- Official implementation for 'SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction' (CVPR 202…☆66Updated last year
- Code for Paper, MUTR3D: A Multi-camera Tracking Framework via 3D-to-2D Queries. https://tsinghua-mars-lab.github.io/mutr3d/☆203Updated 2 years ago
- [ICCV 2023] SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection☆246Updated last year