hustvl / MIM4DLinks
[IJCV 2025] MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning
☆68Updated last month
Alternatives and similar repositories for MIM4D
Users that are interested in MIM4D are comparing it to the libraries listed below
Sorting:
- This is the official implementation of UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving☆61Updated 2 weeks ago
- [ECCV 2024] WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆105Updated 5 months ago
- ☆101Updated 7 months ago
- [ECCV 2024] Occupancy as Set of Points☆90Updated last year
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆44Updated 9 months ago
- [CVPR 2024] Symphonies (Scene-from-Insts): Symphonize 3D Semantic Scene Completion with Contextual Instance Queries☆185Updated last year
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆98Updated 5 months ago
- [CVPR 2024] Official PyTorch Code of SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects☆99Updated 2 months ago
- [CVPR'25] LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes☆53Updated 2 weeks ago
- [ICCV 2025] Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆81Updated 7 months ago
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆69Updated 7 months ago
- ☆105Updated 6 months ago
- ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving☆107Updated 3 weeks ago
- [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆154Updated this week
- [ICLR2025] OccProphet: Pushing Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with Observer-Forecaster-Refiner Framework☆39Updated 3 months ago
- ☆16Updated last year
- OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving☆178Updated last year
- [NeurIPS 2024] TALoS: Enhancing Semantic Scene Completion via Test-time Adaptation on the Line of Sight☆31Updated 3 months ago
- ☆27Updated 10 months ago
- ☆111Updated last year
- High-res 3D Occupancy Dataset for Unified 3D Scene Understanding.☆25Updated last year
- [WACV 2025 Oral] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆57Updated 4 months ago
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆126Updated 4 months ago
- LightwheelOcc: A 3D Occupancy Synthetic Dataset in Autonomous Driving☆96Updated 2 weeks ago
- [ICRA 2025] Official implementation for "TrackOcc: Camera-based 4D Panoptic Occupancy Tracking"☆41Updated 3 weeks ago
- Official Code Release of Delphi☆54Updated last year
- Code for CVPR2025 paper: Generating Multimodal Driving Scenes via Next-Scene Prediction☆69Updated 3 months ago
- [CVPR2024] The code for "MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction"☆112Updated last year
- [ECCV 2024]Official PyTorch Implementation of HTCL : Hierarchical Temporal Context Learning for Camera-based Semantic Scene Completion☆41Updated 5 months ago
- Official repository for paper "Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving"☆29Updated last month