hollow-503/UniM2AE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hollow-503/UniM2AE)

hollow-503 / UniM2AE

[ECCV2024] UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving

☆66

Alternatives and similar repositories for UniM2AE

Users that are interested in UniM2AE are comparing it to the libraries listed below

Sorting:

yanzq95 / DHD
View on GitHub
Deep Height Decoupling for Precise Vision-based 3D Occupancy Prediction (ICRA 2025)
☆53Dec 7, 2025Updated 3 months ago
litwellchi / BEV-SAN
View on GitHub
☆13Feb 6, 2025Updated last year
lucifer443 / RecurrentBEV
View on GitHub
[ECCV 2024] RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection
☆32Sep 28, 2024Updated last year
AlphaPlusTT / DAOcc
View on GitHub
[TCSVT] DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction
☆101Oct 28, 2025Updated 4 months ago
boschresearch / GaussianFlowOcc
View on GitHub
☆52Oct 26, 2025Updated 4 months ago
sanmin0312 / LabelDistill
View on GitHub
[ECCV 2024] LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection
☆42Nov 1, 2024Updated last year
jbji / RepVF
View on GitHub
[ECCV 2024] Official implementation for "RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception"
☆33Mar 27, 2025Updated 11 months ago
Nightmare-n / UniPAD
View on GitHub
UniPAD: A Universal Pre-training Paradigm for Autonomous Driving (CVPR 2024)
☆203Jul 9, 2024Updated last year
CocoBoom / fsd-bev
View on GitHub
This is the implementation of the paper "FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection" (ECCV 2024)
☆34Aug 14, 2025Updated 6 months ago
zhanggang001 / HEDNet
View on GitHub
HEDNet (NeurIPS 2023) & SAFDNet (CVPR 2024 Oral)
☆185Sep 28, 2024Updated last year
zlichen / VectorFormer
View on GitHub
[ECCV 2024] This is the official implementation of Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object …
☆14Jul 12, 2024Updated last year
lixiaoyu2000 / HAT
View on GitHub
Official Repo For AAAI 2026 Accepted Paper "Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception"
☆29Jan 13, 2026Updated last month
gusongen / DOME
View on GitHub
official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*
☆61Jan 10, 2025Updated last year
mengtan00 / SA-BEV
View on GitHub
This is the implementation of the paper "SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection" (I…
☆81Aug 11, 2023Updated 2 years ago
AlmoonYsl / OPEN
View on GitHub
[ECCV 2024] OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection
☆78Sep 26, 2024Updated last year
ZZY816 / COM
View on GitHub
Curricular Object Manipulation in LiDAR-based Object Detection（CVPR 2023）
☆40Aug 1, 2023Updated 2 years ago
nullmax-vision / SimPB
View on GitHub
[ECCV 2024] SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras
☆34Sep 22, 2024Updated last year
StudyingFuFu / L2COcc
View on GitHub
☆22Mar 18, 2025Updated 11 months ago
Ghostish / ObjectCentricOccCompletion
View on GitHub
Official Code Release for "Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection" in NeurIPS 2…
☆29Apr 20, 2025Updated 10 months ago
OpenDriveLab / ViDAR
View on GitHub
[CVPR 2024 Highlight] Visual Point Cloud Forecasting
☆347Jul 2, 2025Updated 8 months ago
Haiyang-W / UniTR
View on GitHub
[ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Bird’s-Eye-View Representation"
☆351Sep 4, 2024Updated last year
DaTongjie / BEVSpread
View on GitHub
☆61Aug 27, 2024Updated last year
Public-BOTs / GaussianPretrain
View on GitHub
GussianPretrain for Visual Pre-training in Autonomous Driving, showcasing significant improvements across various 3D perception tasks, in…
☆110Dec 4, 2025Updated 3 months ago
fzi-forschungszentrum-informatik / muvo
View on GitHub
A Multimodal Generative World Model for Autonomous Driving with Geometric Representations
☆13Aug 27, 2025Updated 6 months ago
xiaomi-mlab / SurroundSDF
View on GitHub
☆10Apr 8, 2024Updated last year
pkqbajng / LOcc
View on GitHub
[ICCV 2025] Language Driven Occupancy Prediction
☆35Dec 23, 2024Updated last year
LiljaAdam / gasp
View on GitHub
GASP: Unifying Geometric and Semantic Self-Supervised Pre-training for Autonomous Driving
☆29Updated this week
AlmoonYsl / QTNet
View on GitHub
[NeurIPS 2023] Query-based Temporal Fusion with Explicit Motion for 3D Object Detection
☆84Jul 2, 2024Updated last year
Liz66666 / GPA3D
View on GitHub
[ICCV 2023] GPA-3D: Geometry-aware Prototype Alignment for Unsupervised Domain Adaptive 3D Object Detection from Point Clouds
☆21Aug 11, 2023Updated 2 years ago
RunsenXu / MV-JAR
View on GitHub
[CVPR 2023] MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training
☆46Jun 4, 2023Updated 2 years ago
Arlo0o / HTCL
View on GitHub
[ECCV 2024, IEEE TPAMI] Official PyTorch Implementation of HTCL : Hierarchical Temporal Context Learning for Camera-based Semantic Scene …
☆51Feb 27, 2026Updated last week
Nightmare-n / GD-MAE
View on GitHub
GD-MAE: Generative Decoder for MAE Pre-training on LiDAR Point Clouds (CVPR 2023)
☆124Apr 18, 2023Updated 2 years ago
qcraftai / distill-bev
View on GitHub
DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)
☆110Nov 24, 2023Updated 2 years ago
ViewFormerOcc / ViewFormer-Occ
View on GitHub
[ECCV 2024] ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D Occupancy Perception via View-Guided Transformers
☆60Nov 1, 2024Updated last year
drilistbox / 3DPPE
View on GitHub
☆72Jul 27, 2023Updated 2 years ago
EnVision-Research / SyntheOcc
View on GitHub
☆103Nov 21, 2024Updated last year
adept-thu / GraphBEV
View on GitHub
[ECCV2024] This is the official implementation of GraphBEV, a BEV multi-modal framework for autonomous driving perception, e.g., 3D objec…
☆125Aug 31, 2024Updated last year
cdb342 / ALOcc
View on GitHub
[ICCV 2025] ALOcc: Adaptive Lifting-based 3D Semantic Occupancy and Cost Volume-based Flow Prediction
☆41Dec 1, 2025Updated 3 months ago
Inspur-Autodrive / Inspur_DABNet4D
View on GitHub
This method performs 3D object detection in the BEV space using images from multiple cameras.
☆32Oct 26, 2022Updated 3 years ago