hollow-503 / UniM2AEView external linksLinks
[ECCV2024] UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving
☆66Sep 4, 2024Updated last year
Alternatives and similar repositories for UniM2AE
Users that are interested in UniM2AE are comparing it to the libraries listed below
Sorting:
- Deep Height Decoupling for Precise Vision-based 3D Occupancy Prediction (ICRA 2025)☆51Dec 7, 2025Updated 2 months ago
- ☆13Feb 6, 2025Updated last year
- [ECCV 2024] RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection☆32Sep 28, 2024Updated last year
- [TCSVT] DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction☆102Oct 28, 2025Updated 3 months ago
- ☆50Oct 26, 2025Updated 3 months ago
- [ECCV 2024] LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection☆41Nov 1, 2024Updated last year
- [ECCV 2024] Official implementation for "RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception"☆33Mar 27, 2025Updated 10 months ago
- UniPAD: A Universal Pre-training Paradigm for Autonomous Driving (CVPR 2024)☆203Jul 9, 2024Updated last year
- This is the implementation of the paper "FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection" (ECCV 2024)☆34Aug 14, 2025Updated 6 months ago
- HEDNet (NeurIPS 2023) & SAFDNet (CVPR 2024 Oral)☆185Sep 28, 2024Updated last year
- [ECCV 2024] This is the official implementation of Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object …☆14Jul 12, 2024Updated last year
- official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*☆61Jan 10, 2025Updated last year
- This is the implementation of the paper "SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection" (I…☆82Aug 11, 2023Updated 2 years ago
- [ECCV 2024] OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection☆74Sep 26, 2024Updated last year
- Curricular Object Manipulation in LiDAR-based Object Detection(CVPR 2023)☆40Aug 1, 2023Updated 2 years ago
- [ECCV 2024] SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras☆34Sep 22, 2024Updated last year
- ☆22Mar 18, 2025Updated 10 months ago
- Official Code Release for "Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection" in NeurIPS 2…☆29Apr 20, 2025Updated 9 months ago
- [ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Bird’s-Eye-View Representation"☆349Sep 4, 2024Updated last year
- [CVPR 2024 Highlight] Visual Point Cloud Forecasting☆348Jul 2, 2025Updated 7 months ago
- ☆60Aug 27, 2024Updated last year
- GussianPretrain for Visual Pre-training in Autonomous Driving, showcasing significant improvements across various 3D perception tasks, in…☆110Dec 4, 2025Updated 2 months ago
- A Multimodal Generative World Model for Autonomous Driving with Geometric Representations☆13Aug 27, 2025Updated 5 months ago
- ☆10Apr 8, 2024Updated last year
- [ICCV 2025] Language Driven Occupancy Prediction☆35Dec 23, 2024Updated last year
- GASP: Unifying Geometric and Semantic Self-Supervised Pre-training for Autonomous Driving☆28Mar 21, 2025Updated 10 months ago
- [ICCV 2023] GPA-3D: Geometry-aware Prototype Alignment for Unsupervised Domain Adaptive 3D Object Detection from Point Clouds☆21Aug 11, 2023Updated 2 years ago
- [NeurIPS 2023] Query-based Temporal Fusion with Explicit Motion for 3D Object Detection☆84Jul 2, 2024Updated last year
- [CVPR 2023] MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training☆46Jun 4, 2023Updated 2 years ago
- [ECCV 2024, TPAMI 2025]Official PyTorch Implementation of HTCL : Hierarchical Temporal Context Learning for Camera-based Semantic Scene C…☆49Dec 31, 2025Updated last month
- GD-MAE: Generative Decoder for MAE Pre-training on LiDAR Point Clouds (CVPR 2023)☆124Apr 18, 2023Updated 2 years ago
- DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)☆110Nov 24, 2023Updated 2 years ago
- [ICCV 2025] ALOcc: Adaptive Lifting-based 3D Semantic Occupancy and Cost Volume-based Flow Prediction☆39Dec 1, 2025Updated 2 months ago
- [ECCV 2024] ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D Occupancy Perception via View-Guided Transformers☆60Nov 1, 2024Updated last year
- ☆73Jul 27, 2023Updated 2 years ago
- ☆103Nov 21, 2024Updated last year
- [ECCV2024] This is the official implementation of GraphBEV, a BEV multi-modal framework for autonomous driving perception, e.g., 3D objec…☆125Aug 31, 2024Updated last year
- [ICCV 2025] GaussRender: Learning 3D Occupancy with Gaussian Rendering (official repository)☆68Jul 7, 2025Updated 7 months ago
- This method performs 3D object detection in the BEV space using images from multiple cameras.☆32Oct 26, 2022Updated 3 years ago