hollow-503 / UniM2AE
[ECCV2024] UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving
☆40Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for UniM2AE
- Codes for ICLR 2024: "MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection"☆65Updated 4 months ago
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆37Updated 2 months ago
- [CVPR 2023] FrustumFormer: Adaptive Instance-aware Resampling for Multi-view 3D Detection☆29Updated 3 months ago
- Official implementation for 'SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction' (CVPR 202…☆47Updated 3 months ago
- Multi-Space Alignments Towards Universal LiDAR Segmentation☆39Updated 3 weeks ago
- Implementation of "PG-RCNN: Semantic Surface Point Generation for 3D Object Detection" (ICCV 2023)☆30Updated 8 months ago
- (AAAI2024) Regulating Intermediate 3D Features for Vision-Centric Autonomous Driving☆18Updated 11 months ago
- [ECCV 2024] Official implementation for "RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception"☆21Updated 4 months ago
- Official PyTorch implementation of End-to-end 3D Tracking with Decoupled Queries [ICCV 2023]☆58Updated 10 months ago
- Code release for the ECCV 2024 paper 'Fully Test-Time Adaptation for Monocular 3D Object Detection'☆29Updated last month
- [ICCV 2023] GPA-3D: Geometry-aware Prototype Alignment for Unsupervised Domain Adaptive 3D Object Detection from Point Clouds☆20Updated last year
- [AAAI24]This is the implementation for the paper M-BEV: Masked BEV Perception for Robust Autonomous Driving☆33Updated 7 months ago
- [ECCV 2024] LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection☆28Updated 3 weeks ago
- [AAAI 2024] BEV-MAE: Bird's Eye View Masked Autoencoders for Point Cloud Pre-training in Autonomous Driving Scenarios☆45Updated 3 months ago
- [ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding☆45Updated last year
- [CVPR24] COTR: Compact Occupancy TRansformer for Vision-based 3D Occupancy Prediction☆59Updated 7 months ago
- Fully Sparse Fusion for 3D Object Detection☆86Updated 4 months ago
- AeDet: Azimuth-invariant Multi-view 3D Object Detection, CVPR2023☆72Updated last year
- [ECCV 2024] OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection☆57Updated last month
- [NeurIPS 2024] OPUS: Occupancy Prediction Using a Sparse Set☆67Updated last month
- [CVPR2024] PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection☆57Updated 4 months ago
- [ICCV 2023] Revisiting Domain-Adaptive 3D Object Detection by Reliable, Diverse and Class-balanced Pseudo-Labeling☆46Updated last year
- Official Toolkit for The RoboDrive Challenge☆73Updated 5 months ago
- Official PyTorch Implementation of HTCL (ECCV 2024): Hierarchical Temporal Context Learning for Camera-based Semantic Scene Completion☆34Updated 4 months ago
- Commonsense Prototype for Outdoor Unsupervised 3D Object Detection (CVPR 2024)☆56Updated 2 months ago
- [IROS 2024]InverseMatrixVT3D: An Efficient Projection Matrix-Based Approach for 3D Occupancy Prediction☆24Updated 4 months ago
- [CVPR2024] The code for "MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction"☆88Updated 7 months ago
- ☆42Updated 3 months ago
- [ECCV 2024] Towards Stable 3D Object Detection☆42Updated 4 months ago
- Repo of "MsSVT: Mixed-scale Sparse Voxel Transformer for 3D Object Detection on Point Clouds".☆37Updated last year