[ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding
☆52Aug 28, 2023Updated 2 years ago
Alternatives and similar repositories for GeoMIM
Users that are interested in GeoMIM are comparing it to the libraries listed below
Sorting:
- ☆11Nov 21, 2022Updated 3 years ago
- The OBMO module embedded in PatchNet☆10Feb 21, 2024Updated 2 years ago
- [AAAI 2024] BEV-MAE: Bird's Eye View Masked Autoencoders for Point Cloud Pre-training in Autonomous Driving Scenarios☆79Jan 7, 2025Updated last year
- [ECCV 2024] RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection☆33Sep 28, 2024Updated last year
- Target Inner-Geometry Learning for BEV 3D Object Detection☆90May 10, 2023Updated 2 years ago
- DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)☆110Nov 24, 2023Updated 2 years ago
- [CVPR 2023] MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training☆46Jun 4, 2023Updated 2 years ago
- This is the official implementation of our manuscript "Mix-Teaching: A Simple, Unified and Effective Semi-supervised Learning Framework …☆42Feb 20, 2023Updated 3 years ago
- [ICLR 2023] BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection☆131Jun 26, 2023Updated 2 years ago
- [CVPR 2023] BEVGuide: BEV-Guided Multi-Modality Fusion for Driving Perception☆36Jun 4, 2023Updated 2 years ago
- A collection of papers about knowledge distillation in autonomous driving.☆30Mar 26, 2024Updated last year
- Toolkit to convert a clean LiDAR-camera dataset into a robustness benchmark☆61May 30, 2022Updated 3 years ago
- [ECCV2024] UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving☆67Sep 4, 2024Updated last year
- Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection☆270Mar 15, 2023Updated 3 years ago
- This repository is for CL3D: Unsupervised Domain Adaptation for Cross-LiDAR 3D Detection.☆27Oct 11, 2023Updated 2 years ago
- ☆26Jan 25, 2024Updated 2 years ago
- Source code for NeurIPS paper "POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images"☆120Jan 7, 2025Updated last year
- 3D occupancy☆402Mar 1, 2023Updated 3 years ago
- [Official] [IROS 2024] A goal-oriented planning to lift VLN performance for Closed-Loop Navigation: Simple, Yet Effective☆28Apr 4, 2024Updated last year
- [TPAMI 2025] Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving☆415Dec 6, 2025Updated 3 months ago
- ALSO: Automotive Lidar Self-supervision by Occupancy estimation☆180Jul 24, 2023Updated 2 years ago
- This method performs 3D object detection in the BEV space using images from multiple cameras.☆32Oct 26, 2022Updated 3 years ago
- Vision-Centric BEV Perception: A Survey☆737Sep 3, 2023Updated 2 years ago
- Official implementation of our RAL'24 paper: Multi-Camera Unified Pre-training for Autonomous Driving☆236Feb 15, 2024Updated 2 years ago
- This is the official implementation of the paper - GeoMAE: Masked Geometric Target Prediction for Self-supervised Point Cloud Pre-Trainin…☆76Jul 18, 2023Updated 2 years ago
- Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection (ECCV 2022 Oral)☆113May 10, 2023Updated 2 years ago
- Unifying Visual Perception by Dispersible Points Learning (ECCV 2022)☆52Aug 19, 2022Updated 3 years ago
- This repo holds trending techniques for sensor fusion task using Transformers☆14Feb 21, 2023Updated 3 years ago
- MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning☆145Jul 2, 2023Updated 2 years ago
- [ECCV 2024] LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection☆42Nov 1, 2024Updated last year
- ☆27Apr 12, 2023Updated 2 years ago
- [IROS 2023] SSC-RS: Elevate LiDAR Semantic Scene Completion with Representation Separation and BEV Fusion☆36Jun 28, 2023Updated 2 years ago
- Unified Architecture Search with Convolution, Transformer, and MLP (ECCV 2022)☆53Dec 20, 2022Updated 3 years ago
- [CVPR 2024 Oral, Best Paper Award Candidate] Official repository of "PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty Awareness…☆227May 31, 2025Updated 9 months ago
- [ECCV 2024, IEEE TPAMI] Official PyTorch Implementation of HTCL : Hierarchical Temporal Context Learning for Camera-based Semantic Scene …☆51Feb 27, 2026Updated 3 weeks ago
- code for Eliminating Cross-modal Conflicts in BEV Space for LiDAR-Camera 3D Object Detection☆18Mar 4, 2024Updated 2 years ago
- Maybe the first academic open work on stereo 3D SSC method with vision-only input.☆318Apr 11, 2023Updated 2 years ago
- An intuitive approach for 3D Occupancy Detection☆125Jun 2, 2023Updated 2 years ago
- [ICCV 2023] Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction☆196Aug 24, 2023Updated 2 years ago