LiAutoAD / DIVE
☆28Updated 6 months ago
Alternatives and similar repositories for DIVE:
Users that are interested in DIVE are comparing it to the libraries listed below
- Official Code Release of Delphi☆54Updated 9 months ago
- ☆13Updated this week
- Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆78Updated 3 months ago
- [ECCV 2024] Occupancy as Set of Points☆88Updated 8 months ago
- ☆83Updated 2 months ago
- ☆32Updated 4 months ago
- ☆12Updated last week
- Official Github Repo for GEM☆25Updated 3 months ago
- ☆15Updated last week
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆65Updated 3 months ago
- ☆42Updated 2 months ago
- [CVPR 2025] DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation☆39Updated last month
- Official repository for paper "Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving"☆28Updated last month
- Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving (AAAI-25)☆34Updated last month
- Official PyTorch implementation of D^2-World as the second place and innovation award of CVPR 2024 Predictive World Model Challenge.☆11Updated 3 months ago
- [CVPR 2023] Are We Ready for Vision-Centric Driving Streaming Perception? The ASAP Benchmark☆76Updated 2 years ago
- [ECCV 2024] Official implementation for "RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception"☆28Updated 4 months ago
- official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*☆38Updated 2 months ago
- [ECCV 2024] WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆103Updated last month
- BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence☆34Updated last week
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆88Updated 2 months ago
- ☆104Updated 8 months ago
- BEVGen☆73Updated last year
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆41Updated 6 months ago
- Code for CVPR2025 paper: Generating Multimodal Driving Scenes via Next-Scene Prediction☆33Updated last week
- Project Page for GaussianFormer☆25Updated 9 months ago
- This repository is dedicated to Track 2 of the W-CODA 2024 Workshop, "Multimodal Perception and Comprehension of Corner Cases in Autonomo…☆10Updated 9 months ago
- [ECCV 2024]Official PyTorch Implementation of HTCL : Hierarchical Temporal Context Learning for Camera-based Semantic Scene Completion☆38Updated last month
- ☆78Updated 2 months ago
- [CVPR 2025] ReconDreamer☆118Updated 3 months ago