hollow-503 / UniM2AE
[ECCV2024] UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving
☆53Updated 5 months ago
Alternatives and similar repositories for UniM2AE:
Users that are interested in UniM2AE are comparing it to the libraries listed below
- [AAAI 2024] BEV-MAE: Bird's Eye View Masked Autoencoders for Point Cloud Pre-training in Autonomous Driving Scenarios☆57Updated last month
- [IEEE RA-L] Co-Occ: Coupling Explicit Feature Fusion with Volume Rendering Regularization for Multi-Modal 3D Semantic Occupancy Predictio…☆47Updated 5 months ago
- [ECCV 2024] LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection☆37Updated 4 months ago
- (TPAMI2024) Fully Sparse Fusion for 3D Object Detection☆92Updated 8 months ago
- Commonsense Prototype for Outdoor Unsupervised 3D Object Detection (CVPR 2024)☆61Updated 5 months ago
- Official Toolkit for The RoboDrive Challenge☆74Updated 9 months ago
- Official implementation for 'SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction' (CVPR 202…☆51Updated 6 months ago
- Codes for ICLR 2024: "MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection"☆69Updated 7 months ago
- Multi-Space Alignments Towards Universal LiDAR Segmentation☆42Updated 4 months ago
- [AAAI 2025] ProtoOcc: Accurate, Efficient 3D Occupancy Prediction Using Dual Branch Encoder-Prototype Query Decoder☆21Updated 2 months ago
- [CVPR2024] The code for "MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction"☆101Updated 10 months ago
- [CVPR 2023] FrustumFormer: Adaptive Instance-aware Resampling for Multi-view 3D Detection☆30Updated 6 months ago
- This is the implementation of the paper "SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection" (I…☆66Updated last year
- OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection☆41Updated 3 months ago
- DifFUSER: Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation☆57Updated 2 months ago
- [IROS 2024]InverseMatrixVT3D: An Efficient Projection Matrix-Based Approach for 3D Occupancy Prediction☆27Updated 8 months ago
- ☆22Updated last year
- Repo of "MsSVT: Mixed-scale Sparse Voxel Transformer for 3D Object Detection on Point Clouds".☆39Updated last year
- Official code implementation for the paper "X-Drive: Cross-modality Consistent Multi-Sensor Data Synthesis for Driving Scenarios"☆36Updated 3 months ago
- (AAAI2024) Regulating Intermediate 3D Features for Vision-Centric Autonomous Driving☆20Updated last year
- [IEEE TIP 2024] Camera-based Semantic Scene Completion with Sparse Guidance Network☆37Updated 5 months ago
- [CVPR24] COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy Prediction☆80Updated 10 months ago
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆40Updated 5 months ago
- ☆55Updated last year
- The official repo for [AAAI 2024] "SimDistill: Simulated Multi-modal Distillation for BEV 3D Object Detection""☆32Updated 9 months ago
- ☆43Updated 6 months ago
- [ECCV 2024] Official implementation for "RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception"☆27Updated 3 months ago
- [ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding☆46Updated last year
- [ECCV 2024] RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection☆22Updated 5 months ago
- [ECCV2022] Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection☆24Updated 2 years ago