[ECCV2024] UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving
☆67Sep 4, 2024Updated last year
Alternatives and similar repositories for UniM2AE
Users that are interested in UniM2AE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV 2024] RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection☆33Sep 28, 2024Updated last year
- ☆13Feb 6, 2025Updated last year
- UniPAD: A Universal Pre-training Paradigm for Autonomous Driving (CVPR 2024)☆204Jul 9, 2024Updated last year
- ☆55Oct 26, 2025Updated 5 months ago
- Deep Height Decoupling for Precise Vision-based 3D Occupancy Prediction (ICRA 2025)☆54Dec 7, 2025Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [TCSVT] DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction☆103Mar 5, 2026Updated 3 weeks ago
- official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*☆61Jan 10, 2025Updated last year
- ☆103Nov 21, 2024Updated last year
- This is the implementation of the paper "FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection" (ECCV 2024)☆34Aug 14, 2025Updated 7 months ago
- Official Code Release for "Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection" in NeurIPS 2…☆30Apr 20, 2025Updated 11 months ago
- [MM2024] FusionOcc: Multi-Modal Fusion for 3D Occupancy Prediction☆23Dec 6, 2024Updated last year
- [ECCV 2024] Ray Denoising (RayDN): Depth-aware Hard Negative Sampling for Multi-view 3D Object Detection☆125Sep 30, 2024Updated last year
- [ECCV 2024] LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection☆42Nov 1, 2024Updated last year
- GASP: Unifying Geometric and Semantic Self-Supervised Pre-training for Autonomous Driving☆30Mar 4, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- HEDNet (NeurIPS 2023) & SAFDNet (CVPR 2024 Oral)☆187Sep 28, 2024Updated last year
- A Multimodal Generative World Model for Autonomous Driving with Geometric Representations☆13Aug 27, 2025Updated 7 months ago
- [ECCV 2024] OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection☆78Sep 26, 2024Updated last year
- [ECCV 2024] This is the official implementation of Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object …☆14Jul 12, 2024Updated last year
- This is the implementation of the paper "SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection" (I…☆81Aug 11, 2023Updated 2 years ago
- Robust and Efficient Occupancy Prediction☆24Jun 2, 2025Updated 9 months ago
- [ECCV 2024] Official implementation for "RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception"☆33Mar 27, 2025Updated last year
- [ECCV 2024] ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D Occupancy Perception via View-Guided Transformers☆60Nov 1, 2024Updated last year
- [ICCV 2025] GaussRender: Learning 3D Occupancy with Gaussian Rendering (official repository)☆69Jul 7, 2025Updated 8 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆10Apr 8, 2024Updated last year
- [CVPR 2024 Highlight] Visual Point Cloud Forecasting☆348Jul 2, 2025Updated 8 months ago
- [CVPR 2023] MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training☆46Jun 4, 2023Updated 2 years ago
- BEVContrast: Self-Supervision in BEV Space for Automotive Lidar Point Clouds - Official PyTorch implementation☆80Jun 4, 2024Updated last year
- [ICCV 2025] Language Driven Occupancy Prediction☆38Dec 23, 2024Updated last year
- Official Repo For AAAI 2026 Accepted Paper "Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception"☆30Updated this week
- Curricular Object Manipulation in LiDAR-based Object Detection(CVPR 2023)☆40Aug 1, 2023Updated 2 years ago
- GussianPretrain for Visual Pre-training in Autonomous Driving, showcasing significant improvements across various 3D perception tasks, in…☆111Dec 4, 2025Updated 3 months ago
- [ECCV 2024] SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras☆34Sep 22, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Bird’s-Eye-View Representation"☆353Sep 4, 2024Updated last year
- [ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding☆52Aug 28, 2023Updated 2 years ago
- This is the official implementation of the paper - GeoMAE: Masked Geometric Target Prediction for Self-supervised Point Cloud Pre-Trainin…☆76Jul 18, 2023Updated 2 years ago
- [ECCV 2024, IEEE TPAMI] Official PyTorch Implementation of HTCL : Hierarchical Temporal Context Learning for Camera-based Semantic Scene …☆52Feb 27, 2026Updated last month
- [AAAI2024] Far3D: Expanding the Horizon for Surround-view 3D Object Detection☆196Dec 13, 2023Updated 2 years ago
- [ICCV 2023] SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos☆457Mar 31, 2024Updated last year
- [TPAMI 2025] Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving☆416Dec 6, 2025Updated 3 months ago