hustvl / MIM4D
MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning
☆62Updated 10 months ago
Alternatives and similar repositories for MIM4D:
Users that are interested in MIM4D are comparing it to the libraries listed below
- WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆93Updated 4 months ago
- [ECCV 2024] Occupancy as Set of Points☆86Updated 6 months ago
- ☆30Updated 2 months ago
- [CVPR 2024] Official PyTorch Code of SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects☆90Updated 3 months ago
- GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆46Updated this week
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆76Updated last week
- ☆73Updated 2 weeks ago
- Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆72Updated last month
- [CVPR 2024] Symphonies (Scene-from-Insts): Symphonize 3D Semantic Scene Completion with Contextual Instance Queries☆173Updated 6 months ago
- ☆27Updated 4 months ago
- OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection☆33Updated 2 months ago
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆60Updated last month
- BEVGen☆70Updated 11 months ago
- [ECCV 2024] Official implementation for "RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception"☆26Updated 2 months ago
- Multi-Space Alignments Towards Universal LiDAR Segmentation☆41Updated 3 months ago
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆40Updated 4 months ago
- Official implementation for the ICCV 2023 paper "NDC-Scene: Boost Monocular 3D Semantic Scene Completion in Normalized Device Coordinates…☆36Updated last year
- OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving☆160Updated 8 months ago
- [CVPR2024] The code for "MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction"☆97Updated 9 months ago
- EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection☆74Updated 9 months ago
- Official Code Release of Delphi☆54Updated 7 months ago
- [WACV 2025 Oral] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆46Updated 10 months ago
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆108Updated last week
- [CVPR 2023] MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training☆47Updated last year
- ☆16Updated last year
- [ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving☆20Updated 2 months ago
- GaussianAD: Gaussian-Centric End-to-End Autonomous Driving☆64Updated last month
- A systematic survey of multi-modal and multi-task visual understanding foundation models for driving scenarios☆48Updated 8 months ago
- ☆104Updated 6 months ago
- Project Page for GaussianFormer☆24Updated 8 months ago