hustvl / MIM4DLinks
[IJCV 2025] MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning
☆76Updated 8 months ago
Alternatives and similar repositories for MIM4D
Users that are interested in MIM4D are comparing it to the libraries listed below
Sorting:
- [ECCV 2024] WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆112Updated last year
- [ECCV 2024] Occupancy as Set of Points☆92Updated last year
- ☆103Updated last year
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆116Updated last year
- [CVPR 2024] Symphonies (Scene-from-Insts): Symphonize 3D Semantic Scene Completion with Contextual Instance Queries☆196Updated last year
- [CVPR'25] LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes☆73Updated 7 months ago
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆50Updated last year
- [CVPR 2024] Official PyTorch Code of SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects☆110Updated 3 weeks ago
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆83Updated last year
- OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving☆196Updated last year
- [WACV 2025 Oral] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆69Updated 2 months ago
- [ICLR 2025] OccProphet: Pushing Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with Observer-Forecaster-Refiner Framework☆56Updated 2 months ago
- [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆207Updated last month
- LightwheelOcc: A 3D Occupancy Synthetic Dataset in Autonomous Driving☆106Updated 7 months ago
- ☆139Updated 2 months ago
- [CVPR2024] The code for "MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction"☆116Updated last year
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆129Updated 11 months ago
- [ICCV 2025] Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆96Updated last year
- [ECCV 2024] Official implementation for "RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception"☆33Updated 10 months ago
- ☆30Updated last year
- Code repository of "GenieDrive: Towards Physics-Aware Driving World Model with 4D Occupancy Guided Video Generation"☆58Updated last month
- Official Toolkit for The RoboDrive Challenge☆75Updated last year
- ☆127Updated last year
- Source code for NeurIPS paper "POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images"☆116Updated last year
- UniPAD: A Universal Pre-training Paradigm for Autonomous Driving (CVPR 2024)☆203Updated last year
- High-res 3D Occupancy Dataset for Unified 3D Scene Understanding.☆29Updated last year
- (ICCV2023) MonoNeRD: NeRF-like Representations for Monocular 3D Object Detection☆85Updated 2 years ago
- project page of "RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning"☆20Updated 4 months ago
- EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection☆83Updated last year
- [NeurIPS 2025] SURDS: Benchmarking Spatial Understanding and Reasoning in Driving Scenarios with Vision Language Models☆78Updated 4 months ago