kaixinbear / CAPE
(CVPR2023) CAPE: Camera View Position Embedding for Multi-View 3D Object Detection
☆106Updated 2 years ago
Alternatives and similar repositories for CAPE:
Users that are interested in CAPE are comparing it to the libraries listed below
- Target Inner-Geometry Learning for BEV 3D Object Detection☆89Updated last year
- Multi-Modal 3D Object Detection by Box Matching☆53Updated last year
- Code for "Object as Query: Lifting any 2D Object Detector to 3D Detection"☆99Updated last year
- StreamPETR with 3dppe Extension☆51Updated last year
- [AAAI 2023] PolarFormer: Multi-camera 3D Object Detection with Polar Transformers☆169Updated 2 years ago
- [CVPR 2024] Learning Occupancy for Monocular 3D Object Detection☆123Updated last year
- "Rethinking IoU-based Optimization for Single-stage 3D Object Detection", ECCV2022 accept!☆131Updated 2 years ago
- [ECCV 2024] Ray Denoising (RayDN): Depth-aware Hard Negative Sampling for Multi-view 3D Object Detection☆104Updated 7 months ago
- [AAAI2024] Far3D: Expanding the Horizon for Surround-view 3D Object Detection☆170Updated last year
- Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection☆255Updated 2 years ago
- ☆61Updated last year
- ☆70Updated 3 years ago
- [CVPR 2024] PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation☆165Updated 8 months ago
- [ECCV 2022] Lidar Point Cloud Guided Monocular 3D Object Detection.☆79Updated 2 years ago
- MonoDTR: Monocular 3D Object Detection with Depth-Aware Transformer (CVPR 2022)☆146Updated 6 months ago
- [IROS 2024 Oral Presentation] WidthFormer: Toward Efficient Transformer-based BEV View Transformation☆143Updated last month
- ☆77Updated 2 years ago
- An Efficient, Flexible, and General deep learning framework that retains minimal.☆117Updated last year
- This is the implementation of the paper "SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection" (I…☆71Updated last year
- ☆42Updated 2 years ago
- Code for MonoJSG: Joint Semantic and Geometric Cost Volume for Monocular 3D Object Detection☆32Updated 2 years ago
- [ICCV 2023] SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection☆226Updated 8 months ago
- AeDet: Azimuth-invariant Multi-view 3D Object Detection, CVPR2023☆74Updated last year
- [ECCV 2024] A Simple and Effective 3D DETR in Point Clouds☆76Updated 6 months ago
- [ECCV2022, IJCAI2022] AutoAlignV2: Deformable Feature Aggregation for Dynamic Multi-Modal 3D Object Detection☆152Updated 2 years ago
- Code base for M3DeTR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers☆99Updated 8 months ago
- (TPAMI2024) Fully Sparse Fusion for 3D Object Detection☆97Updated 10 months ago
- [ICLR 2023] BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection☆123Updated last year
- Voxel Field Fusion for 3D Object Detection (CVPR2022)☆101Updated 2 years ago
- Unifying Voxel-based Representation with Transformer for 3D Object Detection (NeurIPS 2022)☆236Updated 2 years ago