hollow-503 / UniM2AE
[ECCV2024] UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving
☆59Updated 8 months ago
Alternatives and similar repositories for UniM2AE
Users that are interested in UniM2AE are comparing it to the libraries listed below
Sorting:
- [AAAI 2024] BEV-MAE: Bird's Eye View Masked Autoencoders for Point Cloud Pre-training in Autonomous Driving Scenarios☆63Updated 4 months ago
- Codes for ICLR 2024: "MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection"☆70Updated 10 months ago
- ☆49Updated 8 months ago
- Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving (AAAI-25)☆40Updated 3 months ago
- (TPAMI2024) Fully Sparse Fusion for 3D Object Detection☆98Updated 10 months ago
- DifFUSER: Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation☆73Updated 4 months ago
- [IEEE RA-L] Co-Occ: Coupling Explicit Feature Fusion with Volume Rendering Regularization for Multi-Modal 3D Semantic Occupancy Predictio…☆51Updated 7 months ago
- [CVPR24] COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy Prediction☆94Updated last year
- OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection☆48Updated 5 months ago
- [AAAI 2025] ProtoOcc: Accurate, Efficient 3D Occupancy Prediction Using Dual Branch Encoder-Prototype Query Decoder☆24Updated 5 months ago
- Fully Sparse Transformer 3D Detector for LiDAR Point Cloud☆34Updated last year
- [ICLR 2025] Official code implementation for the paper "X-Drive: Cross-modality Consistent Multi-Sensor Data Synthesis for Driving Scenar…☆51Updated 2 months ago
- [CVPR2024] The code for "MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction"☆106Updated last year
- CoIn: Contrastive Instance Feature Mining for Outdoor 3D Object Detection with Very Limited Annotations(ICCV2023)☆36Updated 10 months ago
- [CVPR 2023] Are We Ready for Vision-Centric Driving Streaming Perception? The ASAP Benchmark☆81Updated 2 years ago
- Official implementation for 'SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction' (CVPR 202…☆55Updated 9 months ago
- [ECCV 2024] A Simple and Effective 3D DETR in Point Clouds☆76Updated 6 months ago
- ☆49Updated 5 months ago
- [ECCV 2024] LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection☆38Updated 6 months ago
- [ECCV 2024] The official implementation of DualBEV☆58Updated 10 months ago
- Code release for the ECCV 2024 paper 'Fully Test-Time Adaptation for Monocular 3D Object Detection'☆48Updated 5 months ago
- [IEEE TIP 2024] Camera-based Semantic Scene Completion with Sparse Guidance Network☆38Updated 7 months ago
- [ICLR 2023] BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection☆123Updated last year
- ☆61Updated last year
- GD-MAE: Generative Decoder for MAE Pre-training on LiDAR Point Clouds (CVPR 2023)☆123Updated 2 years ago
- UniPAD: A Universal Pre-training Paradigm for Autonomous Driving (CVPR 2024)☆186Updated 10 months ago
- [CVPR 2023] FrustumFormer: Adaptive Instance-aware Resampling for Multi-view 3D Detection☆30Updated 9 months ago
- This is the implementation of the paper "SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection" (I…☆71Updated last year
- [ECCV 2024] Occupancy as Set of Points☆89Updated 10 months ago
- GussianPretrain for Visual Pre-training in Autonomous Driving, showcasing significant improvements across various 3D perception tasks, in…☆86Updated last month