[ECCV2024] UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving
☆68Sep 4, 2024Updated last year
Alternatives and similar repositories for UniM2AE
Users that are interested in UniM2AE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV 2024] RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection☆33Sep 28, 2024Updated last year
- ☆14Feb 6, 2025Updated last year
- UniPAD: A Universal Pre-training Paradigm for Autonomous Driving (CVPR 2024)☆204Jul 9, 2024Updated last year
- ☆57Oct 26, 2025Updated 6 months ago
- Deep Height Decoupling for Precise Vision-based 3D Occupancy Prediction (ICRA 2025)☆54Dec 7, 2025Updated 5 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [TCSVT] DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction☆107Mar 26, 2026Updated last month
- official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*☆64Apr 12, 2026Updated 3 weeks ago
- ☆103Nov 21, 2024Updated last year
- This is the implementation of the paper "FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection" (ECCV 2024)☆34Aug 14, 2025Updated 8 months ago
- Official Code Release for "Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection" in NeurIPS 2…☆30Apr 20, 2025Updated last year
- [MM2024] FusionOcc: Multi-Modal Fusion for 3D Occupancy Prediction☆24Dec 6, 2024Updated last year
- [ECCV 2024] Ray Denoising (RayDN): Depth-aware Hard Negative Sampling for Multi-view 3D Object Detection☆125Sep 30, 2024Updated last year
- [ECCV 2024] LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection☆42Nov 1, 2024Updated last year
- HEDNet (NeurIPS 2023) & SAFDNet (CVPR 2024 Oral)☆186Sep 28, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A Multimodal Generative World Model for Autonomous Driving with Geometric Representations☆13Aug 27, 2025Updated 8 months ago
- [ECCV 2024] OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection☆78Sep 26, 2024Updated last year
- GASP: Unifying Geometric and Semantic Self-Supervised Pre-training for Autonomous Driving☆34Mar 4, 2026Updated 2 months ago
- [ECCV 2024] This is the official implementation of Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object …☆14Jul 12, 2024Updated last year
- This is the implementation of the paper "SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection" (I…☆81Aug 11, 2023Updated 2 years ago
- Robust and Efficient Occupancy Prediction☆24Jun 2, 2025Updated 11 months ago
- [ECCV 2024] Official implementation for "RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception"☆33Mar 27, 2025Updated last year
- [ECCV 2024] ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D Occupancy Perception via View-Guided Transformers☆60Nov 1, 2024Updated last year
- [ICCV 2025] GaussRender: Learning 3D Occupancy with Gaussian Rendering (official repository)☆73Jul 7, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10Apr 8, 2024Updated 2 years ago
- [CVPR 2024 Highlight] Visual Point Cloud Forecasting☆348Jul 2, 2025Updated 10 months ago
- [CVPR 2023] MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training☆45Jun 4, 2023Updated 2 years ago
- BEVContrast: Self-Supervision in BEV Space for Automotive Lidar Point Clouds - Official PyTorch implementation☆81Jun 4, 2024Updated last year
- [ICCV 2025] Language Driven Occupancy Prediction☆39Dec 23, 2024Updated last year
- Official Repo For AAAI 2026 Accepted Paper "Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception"☆30Mar 25, 2026Updated last month
- Curricular Object Manipulation in LiDAR-based Object Detection(CVPR 2023)☆39Aug 1, 2023Updated 2 years ago
- GussianPretrain for Visual Pre-training in Autonomous Driving, showcasing significant improvements across various 3D perception tasks, in…☆111Dec 4, 2025Updated 5 months ago
- [ECCV 2024] SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras☆34Sep 22, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Bird’s-Eye-View Representation"☆354Sep 4, 2024Updated last year
- [ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding☆53Aug 28, 2023Updated 2 years ago
- This is the official implementation of the paper - GeoMAE: Masked Geometric Target Prediction for Self-supervised Point Cloud Pre-Trainin…☆75Jul 18, 2023Updated 2 years ago
- [ECCV 2024, IEEE TPAMI] Official PyTorch Implementation of HTCL : Hierarchical Temporal Context Learning for Camera-based Semantic Scene …☆52Feb 27, 2026Updated 2 months ago
- [AAAI2024] Far3D: Expanding the Horizon for Surround-view 3D Object Detection☆196Dec 13, 2023Updated 2 years ago
- [ICCV 2023] SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos☆459Mar 31, 2024Updated 2 years ago
- [TPAMI 2025] Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving☆421Dec 6, 2025Updated 5 months ago