Sense-X / GeoMIMLinks
[ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding
☆52Updated 2 years ago
Alternatives and similar repositories for GeoMIM
Users that are interested in GeoMIM are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation of End-to-end 3D Tracking with Decoupled Queries [ICCV 2023]☆73Updated 2 years ago
- AeDet: Azimuth-invariant Multi-view 3D Object Detection, CVPR2023☆77Updated 2 years ago
- [ACM MM2022, TIP2024] Graph-DETR Series for Multi-View 3D Object Detection☆43Updated 2 years ago
- [ECCV2024] UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving☆66Updated last year
- [ICLR 2024] MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection☆75Updated last year
- [CVPR 2023] FrustumFormer: Adaptive Instance-aware Resampling for Multi-view 3D Detection☆30Updated last year
- ☆82Updated 2 years ago
- The official PyTorch implementation of "Exploring Active 3D Object Detection from a Generalization Perspective" (ICLR Spotlight 2023 & TP…☆68Updated 2 years ago
- This is the implementation of the paper "SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection" (I…☆82Updated 2 years ago
- [ICCV 2023] Revisiting Domain-Adaptive 3D Object Detection by Reliable, Diverse and Class-balanced Pseudo-Labeling☆48Updated 2 years ago
- The offical code of PolarBEV (CoRL2022).☆56Updated 3 years ago
- [NeurIPS 2025] OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection☆67Updated last year
- [IROS 2023] DualCross: Cross-Modality Cross-Domain Adaptation for Monocular BEV Perception☆32Updated 2 years ago
- [CVPR 2023] BEVGuide: BEV-Guided Multi-Modality Fusion for Driving Perception☆36Updated 2 years ago
- (CVPR2023) CAPE: Camera View Position Embedding for Multi-View 3D Object Detection☆111Updated 2 years ago
- Curricular Object Manipulation in LiDAR-based Object Detection(CVPR 2023)☆40Updated 2 years ago
- [CVPR 2023] Are We Ready for Vision-Centric Driving Streaming Perception? The ASAP Benchmark☆85Updated 3 years ago
- [CVPR 2023] MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training☆46Updated 2 years ago
- ☆73Updated 2 years ago
- StreamPETR with 3dppe Extension☆51Updated 2 years ago
- [AAAI24]This is the implementation for the paper M-BEV: Masked BEV Perception for Robust Autonomous Driving☆40Updated last year
- Official Toolkit for The RoboDrive Challenge☆75Updated last year
- Fully Sparse Transformer 3D Detector for LiDAR Point Cloud☆37Updated 2 years ago
- Target Inner-Geometry Learning for BEV 3D Object Detection☆90Updated 2 years ago
- ☆27Updated 2 years ago
- [ICLR 2023] BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection☆131Updated 2 years ago
- [CVPR 2024] Learning Occupancy for Monocular 3D Object Detection☆137Updated 2 years ago
- Toolkit to convert a clean LiDAR-camera dataset into a robustness benchmark☆61Updated 3 years ago
- OPUS: Occupancy Prediction Using a Sparse Set☆144Updated last month
- An Efficient, Flexible, and General deep learning framework that retains minimal.☆131Updated 2 years ago