junjie18/CMT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/junjie18/CMT)

junjie18 / CMT

[ICCV 2023] Cross Modal Transformer: Towards Fast and Robust 3D Object Detection

☆415

Alternatives and similar repositories for CMT

Users that are interested in CMT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fudan-zvg / DeepInteraction
View on GitHub
[NeurIPS 2022 & TPAMI 2025] DeepInteraction: 3D Object Detection via Modality Interaction
☆267Apr 16, 2025Updated last year
megvii-research / PETR
View on GitHub
[ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection & [ICCV2023] PETRv2: A Unified Framework for 3D Per…
☆1,064Oct 11, 2023Updated 2 years ago
SxJyJay / MSMDFusion
View on GitHub
[CVPR 2023] MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection
☆207May 20, 2023Updated 3 years ago
yinjunbo / IS-Fusion
View on GitHub
This repository contains the PyTorch implementation of the CVPR'2024 paper (Highlight), IS-Fusion: Instance-Scene Collaborative Fusion fo…
☆220Aug 10, 2024Updated last year
hht1996ok / EA-LSS
View on GitHub
EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object Detection
☆328Nov 9, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Haiyang-W / UniTR
View on GitHub
[ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Bird’s-Eye-View Representation"
☆359Sep 4, 2024Updated last year
XuyangBai / TransFusion
View on GitHub
[PyTorch] Official implementation of CVPR2022 paper "TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers". …
☆769Mar 31, 2023Updated 3 years ago
yichen928 / SparseFusion
View on GitHub
[ICCV 2023] SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection
☆272Aug 16, 2024Updated last year
JIA-Lab-research / UVTR
View on GitHub
Unifying Voxel-based Representation with Transformer for 3D Object Detection (NeurIPS 2022)
☆249Oct 19, 2022Updated 3 years ago
tusen-ai / SST
View on GitHub
Code for a series of work in LiDAR perception, including SST (CVPR 22), FSD (NeurIPS 22), FSD++ (TPAMI 23), FSDv2, and CTRL (ICCV 23, or…
☆883Dec 22, 2024Updated last year
ADLab-AutoDrive / BEVFusion
View on GitHub
Offical PyTorch implementation of "BEVFusion: A Simple and Robust LiDAR-Camera Fusion Framework"
☆979Apr 5, 2023Updated 3 years ago
exiawsh / StreamPETR
View on GitHub
[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
☆824Jun 26, 2024Updated 2 years ago
HuangJunJie2017 / BEVDet
View on GitHub
Code base of the BEVDet series .
☆1,801Jul 4, 2024Updated 2 years ago
Megvii-BaseDetection / BEVDepth
View on GitHub
Official code for BEVDepth.
☆879Jan 18, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
tusen-ai / MV2D
View on GitHub
Code for "Object as Query: Lifting any 2D Object Detector to 3D Detection"
☆115Aug 21, 2023Updated 2 years ago
MCG-NJU / SparseBEV
View on GitHub
[ICCV 2023 & TPAMI 2026] SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos
☆468May 19, 2026Updated 2 months ago
BraveGroup / FullySparseFusion
View on GitHub
(TPAMI2024) Fully Sparse Fusion for 3D Object Detection
☆112Jun 26, 2024Updated 2 years ago
zehuichen123 / AutoAlignV2
View on GitHub
[ECCV2022, IJCAI2022] AutoAlignV2: Deformable Feature Aggregation for Dynamic Multi-Modal 3D Object Detection
☆155Oct 14, 2022Updated 3 years ago
Divadi / SOLOFusion
View on GitHub
Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection
☆271Mar 15, 2023Updated 3 years ago
Eaphan / UPIDet
View on GitHub
Unleash the Potential of Image Branch for Cross-modal 3D Object Detection [NeurIPS2023]
☆67Jun 4, 2024Updated 2 years ago
mit-han-lab / bevfusion
View on GitHub
[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
☆3,226Jul 31, 2024Updated last year
Haiyang-W / DSVT
View on GitHub
[CVPR2023] Official Implementation of "DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets"
☆456Sep 4, 2024Updated last year
Tsinghua-MARS-Lab / futr3d
View on GitHub
Code for paper: FUTR3D: a unified sensor fusion framework for 3d detection
☆351Jul 6, 2023Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
linxuewu / Sparse4D
View on GitHub
Sparse4D v1 & v2
☆517Jun 25, 2024Updated 2 years ago
Sense-GVT / Fast-BEV
View on GitHub
Fast-BEV: A Fast and Strong Bird’s-Eye View Perception Baseline
☆817Sep 6, 2023Updated 2 years ago
fudan-zvg / PolarFormer
View on GitHub
[AAAI 2023] PolarFormer: Multi-camera 3D Object Detection with Polar Transformers
☆177Feb 6, 2023Updated 3 years ago
ADLab3Ds / TiG-BEV
View on GitHub
Target Inner-Geometry Learning for BEV 3D Object Detection
☆89May 10, 2023Updated 3 years ago
TuSimple / centerformer
View on GitHub
Implementation for CenterFormer: Center-based Transformer for 3D Object Detection (ECCV 2022)
☆301Dec 9, 2022Updated 3 years ago
JIA-Lab-research / FocalsConv
View on GitHub
Focal Sparse Convolutional Networks for 3D Object Detection (CVPR 2022, Oral)
☆393Jun 1, 2023Updated 3 years ago
megvii-research / Far3D
View on GitHub
[AAAI2024] Far3D: Expanding the Horizon for Surround-view 3D Object Detection
☆199Dec 13, 2023Updated 2 years ago
worldbench / RoboBEV
View on GitHub
[TPAMI 2025] Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving
☆425Dec 6, 2025Updated 7 months ago
chaytonmin / Awesome-BEV-Perception-Multi-Cameras
View on GitHub
Awesome papers about Multi-Camera 3D Object Detection and Segmentation in Bird's-Eye-View, such as DETR3D, BEVDet, BEVFormer, BEVDepth, U…
☆1,114Apr 26, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Poley97 / FSTR
View on GitHub
Fully Sparse Transformer 3D Detector for LiDAR Point Cloud
☆40Dec 8, 2023Updated 2 years ago
NVlabs / FocalFormer3D
View on GitHub
[ICCV 2023] Official PyTorch implementation of FocalFormer3D
☆211Jan 2, 2025Updated last year
OpenDriveLab / Birds-eye-view-Perception
View on GitHub
[IEEE T-PAMI 2023] Awesome BEV perception research and cookbook for all level audience in autonomous diriving
☆1,381Jul 21, 2025Updated last year
zhanggang001 / HEDNet
View on GitHub
HEDNet (NeurIPS 2023) & SAFDNet (CVPR 2024 Oral)
☆189Sep 28, 2024Updated last year
WayneMao / PillarNeSt
View on GitHub
The Official Implementation of PillarNeSt
☆54May 19, 2025Updated last year
tianweiy / MVP
View on GitHub
☆302Oct 24, 2022Updated 3 years ago
nv-tlabs / lift-splat-shoot
View on GitHub
Lift, Splat, Shoot: Encoding Images from Arbitrary Camera Rigs by Implicitly Unprojecting to 3D (ECCV 2020)
☆1,361Oct 15, 2024Updated last year