Sense-X/GeoMIM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Sense-X/GeoMIM)

Sense-X / GeoMIM

[ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding

☆52

Alternatives and similar repositories for GeoMIM

Users that are interested in GeoMIM are comparing it to the libraries listed below

Sorting:

rockywind / ADD
View on GitHub
☆11Nov 21, 2022Updated 3 years ago
mrsempress / OBMO_patchnet
View on GitHub
The OBMO module embedded in PatchNet
☆10Feb 21, 2024Updated 2 years ago
ADLab3Ds / TiG-BEV
View on GitHub
Target Inner-Geometry Learning for BEV 3D Object Detection
☆90May 10, 2023Updated 2 years ago
lucifer443 / RecurrentBEV
View on GitHub
[ECCV 2024] RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection
☆32Sep 28, 2024Updated last year
VDIGPKU / BEV-MAE
View on GitHub
[AAAI 2024] BEV-MAE: Bird's Eye View Masked Autoencoders for Point Cloud Pre-training in Autonomous Driving Scenarios
☆77Jan 7, 2025Updated last year
RunsenXu / MV-JAR
View on GitHub
[CVPR 2023] MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training
☆46Jun 4, 2023Updated 2 years ago
YunzeMan / BEVGuide
View on GitHub
[CVPR 2023] BEVGuide: BEV-Guided Multi-Modality Fusion for Driving Perception
☆36Jun 4, 2023Updated 2 years ago
4DVLab / CL3D
View on GitHub
This repository is for CL3D: Unsupervised Domain Adaptation for Cross-LiDAR 3D Detection.
☆27Oct 11, 2023Updated 2 years ago
zehuichen123 / BEVDistill
View on GitHub
[ICLR 2023] BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection
☆131Jun 26, 2023Updated 2 years ago
apoorv-ml / Transformers-Sensor-Fusion
View on GitHub
This repo holds trending techniques for sensor fusion task using Transformers
☆14Feb 21, 2023Updated 3 years ago
yanglei18 / Mix-Teaching
View on GitHub
This is the official implementation of our manuscript "Mix-Teaching: A Simple, Unified and Effective Semi-supervised Learning Framework …
☆42Feb 20, 2023Updated 3 years ago
qcraftai / distill-bev
View on GitHub
DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)
☆110Nov 24, 2023Updated 2 years ago
kcyu2014 / lidar-camera-robust-benchmark
View on GitHub
Toolkit to convert a clean LiDAR-camera dataset into a robustness benchmark
☆61May 30, 2022Updated 3 years ago
mikasa3lili / SSD-MonoDETR
View on GitHub
☆26Jan 25, 2024Updated 2 years ago
hollow-503 / UniM2AE
View on GitHub
[ECCV2024] UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving
☆66Sep 4, 2024Updated last year
FANG-MING / occupancy-for-nuscenes
View on GitHub
3D occupancy
☆400Mar 1, 2023Updated 3 years ago
reachpranjal / lego-drive
View on GitHub
[Official] [IROS 2024] A goal-oriented planning to lift VLN performance for Closed-Loop Navigation: Simple, Yet Effective
☆28Apr 4, 2024Updated last year
chaytonmin / UniScene
View on GitHub
Official implementation of our RAL'24 paper: Multi-Camera Unified Pre-training for Autonomous Driving
☆236Feb 15, 2024Updated 2 years ago
cheukcat / The-Eyes-Have-It
View on GitHub
An intuitive approach for 3D Occupancy Detection
☆125Jun 2, 2023Updated 2 years ago
Tsinghua-MARS-Lab / GeoMAE
View on GitHub
This is the official implementation of the paper - GeoMAE: Masked Geometric Target Prediction for Self-supervised Point Cloud Pre-Trainin…
☆75Jul 18, 2023Updated 2 years ago
fjhzhixi / ECFusion
View on GitHub
code for Eliminating Cross-modal Conflicts in BEV Space for LiDAR-Camera 3D Object Detection
☆18Mar 4, 2024Updated last year
4DVLab / Vision-Centric-BEV-Perception
View on GitHub
Vision-Centric BEV Perception: A Survey
☆736Sep 3, 2023Updated 2 years ago
valeoai / ALSO
View on GitHub
ALSO: Automotive Lidar Self-supervision by Occupancy estimation
☆179Jul 24, 2023Updated 2 years ago
Divadi / SOLOFusion
View on GitHub
Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object Detection
☆270Mar 15, 2023Updated 2 years ago
Jieqianyu / SSC-RS
View on GitHub
[IROS 2023] SSC-RS: Elevate LiDAR Semantic Scene Completion with Representation Separation and BEV Fusion
☆36Jun 28, 2023Updated 2 years ago
Inspur-Autodrive / Inspur_DABNet4D
View on GitHub
This method performs 3D object detection in the BEV space using images from multiple cameras.
☆32Oct 26, 2022Updated 3 years ago
weakmono3d / weakmono3d
View on GitHub
☆27Apr 12, 2023Updated 2 years ago
Cc-Hy / UniVision
View on GitHub
☆56Jan 17, 2024Updated 2 years ago
worldbench / RoboBEV
View on GitHub
[TPAMI 2025] Benchmarking and Improving Bird's Eye View Perception Robustness in Autonomous Driving
☆412Dec 6, 2025Updated 2 months ago
chaytonmin / Awesome-Papers-World-Models-Autonomous-Driving
View on GitHub
Awesome Papers about World Models in Autonomous Driving
☆87May 2, 2024Updated last year
Cc-Hy / CMKD
View on GitHub
Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection (ECCV 2022 Oral)
☆113May 10, 2023Updated 2 years ago
westlake-autolab / Delphi
View on GitHub
Official Code Release of Delphi
☆56Jun 4, 2024Updated last year
vobecant / POP3D
View on GitHub
Source code for NeurIPS paper "POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images"
☆118Jan 7, 2025Updated last year
sanmin0312 / LabelDistill
View on GitHub
[ECCV 2024] LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection
☆41Nov 1, 2024Updated last year
cskkxjk / Vampire
View on GitHub
(AAAI2024) Regulating Intermediate 3D Features for Vision-Centric Autonomous Driving
☆21Dec 20, 2023Updated 2 years ago
megvii-research / OccDepth
View on GitHub
Maybe the first academic open work on stereo 3D SSC method with vision-only input.
☆314Apr 11, 2023Updated 2 years ago
astra-vision / PaSCo
View on GitHub
[CVPR 2024 Oral, Best Paper Award Candidate] Official repository of "PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty Awareness…
☆225May 31, 2025Updated 9 months ago
JunchengYan / GroundedSAM_OccNeRF
View on GitHub
☆29Dec 15, 2023Updated 2 years ago
ziqipang / ADDP
View on GitHub
[ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
☆14Jul 4, 2025Updated 7 months ago