Sense-X / GeoMIM
[ICCV 2023] GeoMIM: towards better 3d knowledge transfer via masked image modeling for multi-view 3d understanding
☆47Updated last year
Alternatives and similar repositories for GeoMIM:
Users that are interested in GeoMIM are comparing it to the libraries listed below
- Official PyTorch implementation of End-to-end 3D Tracking with Decoupled Queries [ICCV 2023]☆63Updated last year
- OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection☆46Updated 3 months ago
- This is the implementation of the paper "SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection" (I…☆68Updated last year
- AeDet: Azimuth-invariant Multi-view 3D Object Detection, CVPR2023☆74Updated last year
- [ACM MM2022, TIP2024] Graph-DETR Series for Multi-View 3D Object Detection☆40Updated last year
- The multi-view version of MonoDETR on nuScenes dataset☆20Updated 2 years ago
- Fully Sparse Transformer 3D Detector for LiDAR Point Cloud☆29Updated last year
- An Efficient, Flexible, and General deep learning framework that retains minimal.☆113Updated last year
- ☆76Updated last year
- [AAAI 2024] BEV-MAE: Bird's Eye View Masked Autoencoders for Point Cloud Pre-training in Autonomous Driving Scenarios☆57Updated 2 months ago
- [CVPR 2023] Are We Ready for Vision-Centric Driving Streaming Perception? The ASAP Benchmark☆76Updated 2 years ago
- [CVPR 2023] FrustumFormer: Adaptive Instance-aware Resampling for Multi-view 3D Detection☆30Updated 7 months ago
- DetMatch: Two Teachers are Better Than One for Joint 2D and 3D Semi-Supervised Object Detection☆36Updated last year
- MetaBEV: Solving Sensor Failures for BEV Detection and Map Segmentation☆91Updated last year
- MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering☆20Updated 6 months ago
- [CVPR 2023] MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training☆47Updated last year
- StreamPETR with 3dppe Extension☆50Updated last year
- Target Inner-Geometry Learning for BEV 3D Object Detection☆89Updated last year
- ☆77Updated 2 years ago
- Codes for ICLR 2024: "MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection"☆70Updated 8 months ago
- Toolkit to convert a clean LiDAR-camera dataset into a robustness benchmark☆57Updated 2 years ago
- ☆59Updated last year
- Repo of "MsSVT: Mixed-scale Sparse Voxel Transformer for 3D Object Detection on Point Clouds".☆39Updated last year
- [ICCV 2023] Revisiting Domain-Adaptive 3D Object Detection by Reliable, Diverse and Class-balanced Pseudo-Labeling☆47Updated last year
- (AAAI2024) Regulating Intermediate 3D Features for Vision-Centric Autonomous Driving☆20Updated last year
- ☆18Updated 2 years ago
- [IROS 2023] DualCross: Cross-Modality Cross-Domain Adaptation for Monocular BEV Perception☆29Updated last year
- [NeurIPS 2024] OPUS: Occupancy Prediction Using a Sparse Set☆90Updated last month
- Implementation of SimMOD: A Simple Baseline for Multi-Camera 3D Object Detection☆48Updated 2 years ago
- [ECCV2024] UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving☆55Updated 6 months ago