hustvl / GKT
Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel Transformer
☆228Updated last year
Related projects ⓘ
Alternatives and complementary repositories for GKT
- Official code for "Structured Bird’s-Eye-View Traffic Scene Understanding from Onboard Images" (ICCV 2021)☆203Updated 2 years ago
- Code for Paper, MUTR3D: A Multi-camera Tracking Framework via 3D-to-2D Queries. https://tsinghua-mars-lab.github.io/mutr3d/☆179Updated last year
- ☆202Updated 8 months ago
- [AAAI 2023] PolarFormer: Multi-camera 3D Object Detection with Polar Transformers☆161Updated last year
- The official repository for BEVerse☆395Updated 2 years ago
- [ECCV 2024] Lane Graph as Path: Continuity-preserving Path-wise Modeling for Online Lane Graph Construction☆125Updated 2 months ago
- [ICLR2024] TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning☆163Updated 3 months ago
- Source code of PivotNet (ICCV2023, PivotNet: Vectorized Pivot Learning for End-to-end HD Map Construction)☆101Updated 8 months ago
- Official PyTorch implementation for paper`Anchor3DLane: Learning to Regress 3D Anchors for Monocular 3D Lane Detection' accepted by CVPR …☆146Updated 5 months ago
- A general map auto annotation framework based on MapTR, with high flexibility in terms of spatial scale and element type☆214Updated 9 months ago
- An official code release of our CVPR'23 paper, BEVHeight☆202Updated 3 months ago
- Code for paper "MapTracker: Tracking with Strided Memory Fusion for Consistent Vector HD Mapping", ECCV 2024 (Oral)☆174Updated 2 months ago
- ☆190Updated last year
- Official code for BEVStereo☆261Updated 2 years ago
- Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection☆239Updated last year
- Implemented BEVFormer support for BEV segmentation☆104Updated last year
- [AAAI2024] Far3D: Expanding the Horizon for Surround-view 3D Object Detection☆150Updated 11 months ago
- [CVPR'21] Projecting Your View Attentively: Monocular Road Scene Layout Estimation via Cross-view Transformation☆148Updated 3 years ago
- EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object Detection☆239Updated last year
- [ICLR 2024] Map Learning with Lane Segment for Autonomous Driving☆267Updated 4 months ago
- [ICCV2023 Oral] LATR: 3D Lane Detection from Monocular Images with Transformer☆180Updated 2 months ago
- ☆103Updated 2 years ago
- [IROS 2024 Oral Presentation] WidthFormer: Toward Efficient Transformer-based BEV View Transformation☆131Updated 5 months ago
- ☆228Updated last year
- Official implementation of our RAL'24 paper: Multi-Camera Unified Pre-training for Autonomous Driving☆208Updated 9 months ago
- An intuitive approach for 3D Occupancy Detection☆126Updated last year
- Awesome papers about Multi-Camera Semantic Occupancy Prediction, such as TPVFormer, OccFormer, Occ3D, OpenOccupancy☆218Updated last year
- Code for paper: FUTR3D: a unified sensor fusion framework for 3d detection☆278Updated last year
- [ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Bird’s-Eye-View Representation"☆286Updated 2 months ago
- (CVPR2023) CAPE: Camera View Position Embedding for Multi-View 3D Object Detection☆102Updated last year