[ECCV2024] UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving
☆68Sep 4, 2024Updated last year
Alternatives and similar repositories for UniM2AE
Users that are interested in UniM2AE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV 2024] RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection☆33Sep 28, 2024Updated last year
- ☆14Feb 6, 2025Updated last year
- UniPAD: A Universal Pre-training Paradigm for Autonomous Driving (CVPR 2024)☆204Jul 9, 2024Updated last year
- ☆57Oct 26, 2025Updated 5 months ago
- Deep Height Decoupling for Precise Vision-based 3D Occupancy Prediction (ICRA 2025)☆54Dec 7, 2025Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [TCSVT] DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction☆107Mar 26, 2026Updated 3 weeks ago
- official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*☆63Updated this week
- ☆103Nov 21, 2024Updated last year
- This is the implementation of the paper "FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection" (ECCV 2024)☆34Aug 14, 2025Updated 8 months ago
- Official Code Release for "Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection" in NeurIPS 2…☆30Apr 20, 2025Updated 11 months ago
- [MM2024] FusionOcc: Multi-Modal Fusion for 3D Occupancy Prediction☆24Dec 6, 2024Updated last year
- [ECCV 2024] Ray Denoising (RayDN): Depth-aware Hard Negative Sampling for Multi-view 3D Object Detection☆125Sep 30, 2024Updated last year
- [ECCV 2024] LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection☆42Nov 1, 2024Updated last year
- GASP: Unifying Geometric and Semantic Self-Supervised Pre-training for Autonomous Driving☆32Mar 4, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- HEDNet (NeurIPS 2023) & SAFDNet (CVPR 2024 Oral)☆186Sep 28, 2024Updated last year
- A Multimodal Generative World Model for Autonomous Driving with Geometric Representations☆13Aug 27, 2025Updated 7 months ago
- [ECCV 2024] OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection☆78Sep 26, 2024Updated last year
- [ECCV 2024] This is the official implementation of Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object …☆14Jul 12, 2024Updated last year
- This is the implementation of the paper "SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection" (I…☆80Aug 11, 2023Updated 2 years ago
- Robust and Efficient Occupancy Prediction☆24Jun 2, 2025Updated 10 months ago
- [ECCV 2024] Official implementation for "RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception"☆33Mar 27, 2025Updated last year
- [ECCV 2024] ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D Occupancy Perception via View-Guided Transformers☆60Nov 1, 2024Updated last year
- [ICCV 2025] GaussRender: Learning 3D Occupancy with Gaussian Rendering (official repository)☆70Jul 7, 2025Updated 9 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆10Apr 8, 2024Updated 2 years ago
- [CVPR 2024 Highlight] Visual Point Cloud Forecasting☆349Jul 2, 2025Updated 9 months ago
- [CVPR 2023] MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training☆46Jun 4, 2023Updated 2 years ago
- BEVContrast: Self-Supervision in BEV Space for Automotive Lidar Point Clouds - Official PyTorch implementation☆80Jun 4, 2024Updated last year
- [ICCV 2025] Language Driven Occupancy Prediction☆39Dec 23, 2024Updated last year
- Official Repo For AAAI 2026 Accepted Paper "Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception"☆30Mar 25, 2026Updated 3 weeks ago
- Curricular Object Manipulation in LiDAR-based Object Detection (CVPR 2023)☆40Aug 1, 2023Updated 2 years ago
- GussianPretrain for Visual Pre-training in Autonomous Driving, showcasing significant improvements across various 3D perception tasks, in…☆111Dec 4, 2025Updated 4 months ago
- [ECCV 2024] SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras☆34Sep 22, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Bird’s-Eye-View Representation"☆354Sep 4, 2024Updated last year
- [ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding☆52Aug 28, 2023Updated 2 years ago
- This is the official implementation of the paper - GeoMAE: Masked Geometric Target Prediction for Self-supervised Point Cloud Pre-Trainin…☆76Jul 18, 2023Updated 2 years ago
- [ECCV 2024, IEEE TPAMI] Official PyTorch Implementation of HTCL : Hierarchical Temporal Context Learning for Camera-based Semantic Scene …☆53Feb 27, 2026Updated last month
- [AAAI2024] Far3D: Expanding the Horizon for Surround-view 3D Object Detection☆196Dec 13, 2023Updated 2 years ago
- [ICCV 2023] SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos☆458Mar 31, 2024Updated 2 years ago
- [TPAMI 2025] Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving☆421Dec 6, 2025Updated 4 months ago