[ICCV 2023] Cross Modal Transformer: Towards Fast and Robust 3D Object Detection
☆410Oct 7, 2023Updated 2 years ago
Alternatives and similar repositories for CMT
Users that are interested in CMT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2022 & TPAMI 2025] DeepInteraction: 3D Object Detection via Modality Interaction☆259Apr 16, 2025Updated 11 months ago
- [ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection & [ICCV2023] PETRv2: A Unified Framework for 3D Per…☆1,044Oct 11, 2023Updated 2 years ago
- [CVPR 2023] MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection☆203May 20, 2023Updated 2 years ago
- This repository contains the PyTorch implementation of the CVPR'2024 paper (Highlight), IS-Fusion: Instance-Scene Collaborative Fusion fo…☆202Aug 10, 2024Updated last year
- EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object Detection☆324Nov 9, 2023Updated 2 years ago
- [ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Bird’s-Eye-View Representation"☆353Sep 4, 2024Updated last year
- [PyTorch] Official implementation of CVPR2022 paper "TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers". …☆746Mar 31, 2023Updated 2 years ago
- [ICCV 2023] SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection☆260Aug 16, 2024Updated last year
- Unifying Voxel-based Representation with Transformer for 3D Object Detection (NeurIPS 2022)☆249Oct 19, 2022Updated 3 years ago
- Code for a series of work in LiDAR perception, including SST (CVPR 22), FSD (NeurIPS 22), FSD++ (TPAMI 23), FSDv2, and CTRL (ICCV 23, or…☆874Dec 22, 2024Updated last year
- Offical PyTorch implementation of "BEVFusion: A Simple and Robust LiDAR-Camera Fusion Framework"☆950Apr 5, 2023Updated 2 years ago
- [ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection☆782Jun 26, 2024Updated last year
- Code base of the BEVDet series .☆1,755Jul 4, 2024Updated last year
- Official code for BEVDepth.☆860Jan 18, 2023Updated 3 years ago
- Code for "Object as Query: Lifting any 2D Object Detector to 3D Detection"☆114Aug 21, 2023Updated 2 years ago
- [ICCV 2023] SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos☆457Mar 31, 2024Updated last year
- (TPAMI2024) Fully Sparse Fusion for 3D Object Detection☆114Jun 26, 2024Updated last year
- Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection☆271Mar 15, 2023Updated 3 years ago
- [ECCV2022, IJCAI2022] AutoAlignV2: Deformable Feature Aggregation for Dynamic Multi-Modal 3D Object Detection☆154Oct 14, 2022Updated 3 years ago
- Unleash the Potential of Image Branch for Cross-modal 3D Object Detection [NeurIPS2023]☆65Jun 4, 2024Updated last year
- [ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation☆3,055Jul 31, 2024Updated last year
- [CVPR2023] Official Implementation of "DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets"☆452Sep 4, 2024Updated last year
- Sparse4D v1 & v2☆501Jun 25, 2024Updated last year
- Code for paper: FUTR3D: a unified sensor fusion framework for 3d detection☆342Jul 6, 2023Updated 2 years ago
- Fast-BEV: A Fast and Strong Bird’s-Eye View Perception Baseline☆786Sep 6, 2023Updated 2 years ago
- [AAAI 2023] PolarFormer: Multi-camera 3D Object Detection with Polar Transformers☆176Feb 6, 2023Updated 3 years ago
- Target Inner-Geometry Learning for BEV 3D Object Detection☆90May 10, 2023Updated 2 years ago
- Implementation for CenterFormer: Center-based Transformer for 3D Object Detection (ECCV 2022)☆304Dec 9, 2022Updated 3 years ago
- [AAAI2024] Far3D: Expanding the Horizon for Surround-view 3D Object Detection☆195Dec 13, 2023Updated 2 years ago
- Focal Sparse Convolutional Networks for 3D Object Detection (CVPR 2022, Oral)☆390Jun 1, 2023Updated 2 years ago
- [TPAMI 2025] Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving☆415Dec 6, 2025Updated 3 months ago
- Awesome papers about Multi-Camera 3D Object Detection and Segmentation in Bird's-Eye-View, such as DETR3D, BEVDet, BEVFormer, BEVDepth, U…☆1,099Apr 26, 2024Updated last year
- MV2DFusion☆91Dec 26, 2025Updated 2 months ago
- [ICCV 2023] Official PyTorch implementation of FocalFormer3D☆207Jan 2, 2025Updated last year
- Fully Sparse Transformer 3D Detector for LiDAR Point Cloud☆39Dec 8, 2023Updated 2 years ago
- [IEEE T-PAMI 2023] Awesome BEV perception research and cookbook for all level audience in autonomous diriving☆1,361Jul 21, 2025Updated 8 months ago
- HEDNet (NeurIPS 2023) & SAFDNet (CVPR 2024 Oral)☆185Sep 28, 2024Updated last year
- Lift, Splat, Shoot: Encoding Images from Arbitrary Camera Rigs by Implicitly Unprojecting to 3D (ECCV 2020)☆1,321Oct 15, 2024Updated last year
- ☆302Oct 24, 2022Updated 3 years ago