[ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding
☆52Aug 28, 2023Updated 2 years ago
Alternatives and similar repositories for GeoMIM
Users that are interested in GeoMIM are comparing it to the libraries listed below
Sorting:
- ☆11Nov 21, 2022Updated 3 years ago
- The OBMO module embedded in PatchNet☆10Feb 21, 2024Updated 2 years ago
- Target Inner-Geometry Learning for BEV 3D Object Detection☆90May 10, 2023Updated 2 years ago
- [ECCV 2024] RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection☆32Sep 28, 2024Updated last year
- [AAAI 2024] BEV-MAE: Bird's Eye View Masked Autoencoders for Point Cloud Pre-training in Autonomous Driving Scenarios☆77Jan 7, 2025Updated last year
- [CVPR 2023] MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training☆46Jun 4, 2023Updated 2 years ago
- [CVPR 2023] BEVGuide: BEV-Guided Multi-Modality Fusion for Driving Perception☆36Jun 4, 2023Updated 2 years ago
- This repository is for CL3D: Unsupervised Domain Adaptation for Cross-LiDAR 3D Detection.☆27Oct 11, 2023Updated 2 years ago
- [ICLR 2023] BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection☆131Jun 26, 2023Updated 2 years ago
- This repo holds trending techniques for sensor fusion task using Transformers☆14Feb 21, 2023Updated 3 years ago
- This is the official implementation of our manuscript "Mix-Teaching: A Simple, Unified and Effective Semi-supervised Learning Framework …☆42Feb 20, 2023Updated 3 years ago
- DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)☆110Nov 24, 2023Updated 2 years ago
- Toolkit to convert a clean LiDAR-camera dataset into a robustness benchmark☆61May 30, 2022Updated 3 years ago
- ☆26Jan 25, 2024Updated 2 years ago
- [ECCV2024] UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving☆66Sep 4, 2024Updated last year
- 3D occupancy☆400Mar 1, 2023Updated 3 years ago
- [Official] [IROS 2024] A goal-oriented planning to lift VLN performance for Closed-Loop Navigation: Simple, Yet Effective☆28Apr 4, 2024Updated last year
- Official implementation of our RAL'24 paper: Multi-Camera Unified Pre-training for Autonomous Driving☆236Feb 15, 2024Updated 2 years ago
- An intuitive approach for 3D Occupancy Detection☆125Jun 2, 2023Updated 2 years ago
- This is the official implementation of the paper - GeoMAE: Masked Geometric Target Prediction for Self-supervised Point Cloud Pre-Trainin…☆75Jul 18, 2023Updated 2 years ago
- code for Eliminating Cross-modal Conflicts in BEV Space for LiDAR-Camera 3D Object Detection☆18Mar 4, 2024Updated last year
- Vision-Centric BEV Perception: A Survey☆736Sep 3, 2023Updated 2 years ago
- ALSO: Automotive Lidar Self-supervision by Occupancy estimation☆179Jul 24, 2023Updated 2 years ago
- Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection☆270Mar 15, 2023Updated 2 years ago
- [IROS 2023] SSC-RS: Elevate LiDAR Semantic Scene Completion with Representation Separation and BEV Fusion☆36Jun 28, 2023Updated 2 years ago
- This method performs 3D object detection in the BEV space using images from multiple cameras.☆32Oct 26, 2022Updated 3 years ago
- ☆27Apr 12, 2023Updated 2 years ago
- ☆56Jan 17, 2024Updated 2 years ago
- [TPAMI 2025] Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving☆412Dec 6, 2025Updated 2 months ago
- Awesome Papers about World Models in Autonomous Driving☆87May 2, 2024Updated last year
- Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection (ECCV 2022 Oral)☆113May 10, 2023Updated 2 years ago
- Official Code Release of Delphi☆56Jun 4, 2024Updated last year
- Source code for NeurIPS paper "POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images"☆118Jan 7, 2025Updated last year
- [ECCV 2024] LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection☆41Nov 1, 2024Updated last year
- (AAAI2024) Regulating Intermediate 3D Features for Vision-Centric Autonomous Driving☆21Dec 20, 2023Updated 2 years ago
- Maybe the first academic open work on stereo 3D SSC method with vision-only input.☆314Apr 11, 2023Updated 2 years ago
- [CVPR 2024 Oral, Best Paper Award Candidate] Official repository of "PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty Awareness…☆225May 31, 2025Updated 9 months ago
- ☆29Dec 15, 2023Updated 2 years ago
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆14Jul 4, 2025Updated 7 months ago