hollow-503 / UniM2AELinks
[ECCV2024] UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving
☆61Updated 10 months ago
Alternatives and similar repositories for UniM2AE
Users that are interested in UniM2AE are comparing it to the libraries listed below
Sorting:
- [CVPR2024] The code for "MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction"☆112Updated last year
- Fully Sparse Transformer 3D Detector for LiDAR Point Cloud☆36Updated last year
- [ICLR 2024] MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection☆73Updated last year
- [AAAI 2024] BEV-MAE: Bird's Eye View Masked Autoencoders for Point Cloud Pre-training in Autonomous Driving Scenarios☆68Updated 6 months ago
- [ECCV 2024] LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection☆39Updated 8 months ago
- [ECCV 2024] OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection☆69Updated 9 months ago
- [CVPR 2023] FrustumFormer: Adaptive Instance-aware Resampling for Multi-view 3D Detection☆30Updated 11 months ago
- [AAAI 2025] ProtoOcc: Accurate, Efficient 3D Occupancy Prediction Using Dual Branch Encoder-Prototype Query Decoder☆30Updated last week
- Commonsense Prototype for Outdoor Unsupervised 3D Object Detection (CVPR 2024)☆66Updated last week
- Code release for the ECCV 2024 paper 'Fully Test-Time Adaptation for Monocular 3D Object Detection'☆53Updated 7 months ago
- Multi-Space Alignments Towards Universal LiDAR Segmentation☆47Updated 8 months ago
- (TPAMI2024) Fully Sparse Fusion for 3D Object Detection☆102Updated last year
- BEVContrast: Self-Supervision in BEV Space for Automotive Lidar Point Clouds - Official PyTorch implementation☆78Updated last year
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆44Updated 9 months ago
- [IEEE TIP 2024] Camera-based Semantic Scene Completion with Sparse Guidance Network☆40Updated 9 months ago
- DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction☆69Updated this week
- Official implementation for 'SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction' (CVPR 202…☆60Updated 11 months ago
- [AAAI24]This is the implementation for the paper M-BEV: Masked BEV Perception for Robust Autonomous Driving☆38Updated last year
- [IEEE RA-L] Co-Occ: Coupling Explicit Feature Fusion with Volume Rendering Regularization for Multi-Modal 3D Semantic Occupancy Predictio…☆59Updated 9 months ago
- ☆50Updated 7 months ago
- [CVPR 2023] BEVGuide: BEV-Guided Multi-Modality Fusion for Driving Perception☆34Updated 2 years ago
- This is the implementation of the paper "SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection" (I…☆77Updated last year
- [ECCV 2024] A Simple and Effective 3D DETR in Point Clouds☆81Updated 8 months ago
- [CVPR24] COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy Prediction☆100Updated last year
- CoIn: Contrastive Instance Feature Mining for Outdoor 3D Object Detection with Very Limited Annotations(ICCV2023)☆38Updated last year
- (AAAI2024) Regulating Intermediate 3D Features for Vision-Centric Autonomous Driving☆20Updated last year
- [ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding☆48Updated last year
- Official Toolkit for The RoboDrive Challenge☆73Updated last year
- DifFUSER: Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation☆81Updated 6 months ago
- [CVPR2024] PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection☆70Updated last year