[ICCV 2023] Cross Modal Transformer: Towards Fast and Robust 3D Object Detection
☆411Oct 7, 2023Updated 2 years ago
Alternatives and similar repositories for CMT
Users that are interested in CMT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2022 & TPAMI 2025] DeepInteraction: 3D Object Detection via Modality Interaction☆264Apr 16, 2025Updated last year
- [ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection & [ICCV2023] PETRv2: A Unified Framework for 3D Per…☆1,057Oct 11, 2023Updated 2 years ago
- [CVPR 2023] MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection☆206May 20, 2023Updated 3 years ago
- This repository contains the PyTorch implementation of the CVPR'2024 paper (Highlight), IS-Fusion: Instance-Scene Collaborative Fusion fo…☆212Aug 10, 2024Updated last year
- EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object Detection☆326Nov 9, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Bird’s-Eye-View Representation"☆354Sep 4, 2024Updated last year
- [PyTorch] Official implementation of CVPR2022 paper "TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers". …☆760Mar 31, 2023Updated 3 years ago
- [ICCV 2023] SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection☆267Aug 16, 2024Updated last year
- Unifying Voxel-based Representation with Transformer for 3D Object Detection (NeurIPS 2022)☆248Oct 19, 2022Updated 3 years ago
- Code for a series of work in LiDAR perception, including SST (CVPR 22), FSD (NeurIPS 22), FSD++ (TPAMI 23), FSDv2, and CTRL (ICCV 23, or…☆877Dec 22, 2024Updated last year
- Offical PyTorch implementation of "BEVFusion: A Simple and Robust LiDAR-Camera Fusion Framework"☆965Apr 5, 2023Updated 3 years ago
- [ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection☆808Jun 26, 2024Updated last year
- Code base of the BEVDet series .☆1,785Jul 4, 2024Updated last year
- Official code for BEVDepth.☆871Jan 18, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for "Object as Query: Lifting any 2D Object Detector to 3D Detection"☆115Aug 21, 2023Updated 2 years ago
- [ICCV 2023 & TPAMI 2026] SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos☆460May 12, 2026Updated last week
- (TPAMI2024) Fully Sparse Fusion for 3D Object Detection☆113Jun 26, 2024Updated last year
- Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection☆270Mar 15, 2023Updated 3 years ago
- [ECCV2022, IJCAI2022] AutoAlignV2: Deformable Feature Aggregation for Dynamic Multi-Modal 3D Object Detection☆154Oct 14, 2022Updated 3 years ago
- [ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation☆3,142Jul 31, 2024Updated last year
- Unleash the Potential of Image Branch for Cross-modal 3D Object Detection [NeurIPS2023]☆66Jun 4, 2024Updated last year
- [CVPR2023] Official Implementation of "DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets"☆451Sep 4, 2024Updated last year
- Code for paper: FUTR3D: a unified sensor fusion framework for 3d detection☆348Jul 6, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Sparse4D v1 & v2☆510Jun 25, 2024Updated last year
- Fast-BEV: A Fast and Strong Bird’s-Eye View Perception Baseline☆808Sep 6, 2023Updated 2 years ago
- [AAAI 2023] PolarFormer: Multi-camera 3D Object Detection with Polar Transformers☆176Feb 6, 2023Updated 3 years ago
- Target Inner-Geometry Learning for BEV 3D Object Detection☆89May 10, 2023Updated 3 years ago
- Implementation for CenterFormer: Center-based Transformer for 3D Object Detection (ECCV 2022)☆304Dec 9, 2022Updated 3 years ago
- [AAAI2024] Far3D: Expanding the Horizon for Surround-view 3D Object Detection☆196Dec 13, 2023Updated 2 years ago
- Focal Sparse Convolutional Networks for 3D Object Detection (CVPR 2022, Oral)☆391Jun 1, 2023Updated 2 years ago
- [TPAMI 2025] Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving☆421Dec 6, 2025Updated 5 months ago
- Awesome papers about Multi-Camera 3D Object Detection and Segmentation in Bird's-Eye-View, such as DETR3D, BEVDet, BEVFormer, BEVDepth, U…☆1,107Apr 26, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MV2DFusion☆100Dec 26, 2025Updated 4 months ago
- [ICCV 2023] Official PyTorch implementation of FocalFormer3D☆207Jan 2, 2025Updated last year
- Fully Sparse Transformer 3D Detector for LiDAR Point Cloud☆39Dec 8, 2023Updated 2 years ago
- [IEEE T-PAMI 2023] Awesome BEV perception research and cookbook for all level audience in autonomous diriving☆1,376Jul 21, 2025Updated 10 months ago
- HEDNet (NeurIPS 2023) & SAFDNet (CVPR 2024 Oral)☆187Sep 28, 2024Updated last year
- ☆303Oct 24, 2022Updated 3 years ago
- Lift, Splat, Shoot: Encoding Images from Arbitrary Camera Rigs by Implicitly Unprojecting to 3D (ECCV 2020)☆1,339Oct 15, 2024Updated last year