hustvl / MIM4D
MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning
☆62Updated 11 months ago
Alternatives and similar repositories for MIM4D:
Users that are interested in MIM4D are comparing it to the libraries listed below
- ☆31Updated 3 months ago
- [ECCV 2024] WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆100Updated last month
- [ECCV 2024] Occupancy as Set of Points☆89Updated 8 months ago
- [CVPR 2025 Almost Oral ; )] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆102Updated last week
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆85Updated last month
- [CVPR 2024] Symphonies (Scene-from-Insts): Symphonize 3D Semantic Scene Completion with Contextual Instance Queries☆175Updated 8 months ago
- [CVPR'25] LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes☆34Updated this week
- [CVPR 2024] Official PyTorch Code of SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects☆92Updated 4 months ago
- [ECCV 2024] Official implementation for "RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception"☆27Updated 3 months ago
- [WACV 2025 Oral] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆53Updated last week
- ☆79Updated last month
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆40Updated 5 months ago
- ☆28Updated 6 months ago
- Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆76Updated 3 months ago
- OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection☆42Updated 3 months ago
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆61Updated 3 months ago
- [ECCV 2024] 4D Contrastive Superflows are Dense 3D Representation Learners☆42Updated 2 months ago
- ☆104Updated 8 months ago
- OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving☆166Updated 9 months ago
- Multi-Space Alignments Towards Universal LiDAR Segmentation☆42Updated 4 months ago
- [CVPR 2025] ReconDreamer☆104Updated 3 months ago
- ☆76Updated last year
- Source code for NeurIPS paper "POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images"☆103Updated 2 months ago
- Project Page for GaussianFormer☆25Updated 9 months ago
- [CVPR 2025] DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation☆37Updated last week
- Official code implementation for the paper "X-Drive: Cross-modality Consistent Multi-Sensor Data Synthesis for Driving Scenarios"☆36Updated 4 months ago
- [RA-L] DreamCar: Leveraging Car-specific Prior for in-the-wild 3D Car Reconstruction☆71Updated 5 months ago
- [CVPR 2023] Are We Ready for Vision-Centric Driving Streaming Perception? The ASAP Benchmark☆76Updated 2 years ago
- [CVPR2025]UniScene: Unified Occupancy-centric Driving Scene Generation☆73Updated 2 months ago