xiaomi-research / genesisLinks
Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency
☆33Updated last month
Alternatives and similar repositories for genesis
Users that are interested in genesis are comparing it to the libraries listed below
Sorting:
- Official Code for Epona: Autoregressive Diffusion World Model for Autonomous Driving (ICCV 2025)☆76Updated this week
- [ICCV 2025] Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆83Updated 7 months ago
- [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆154Updated last week
- Code for "DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT"☆195Updated 6 months ago
- Code for CVPR2025 paper: Generating Multimodal Driving Scenes via Next-Scene Prediction☆70Updated 4 months ago
- ☆105Updated 6 months ago
- Official Github Repo for GEM☆76Updated 3 weeks ago
- [CVPR 2025] ReconDreamer☆163Updated 7 months ago
- Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models☆164Updated 2 weeks ago
- OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving☆178Updated last year
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆126Updated 4 months ago
- [CVPR 2025] DriveDreamer4D☆218Updated 3 months ago
- [AAAI 2025] DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation☆188Updated 3 months ago
- Large Driving Models☆232Updated 5 months ago
- [CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"☆234Updated 11 months ago
- ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving☆111Updated last month
- An open source code repository of driving world models, with training, inferencing, evaluation tools, and pretrained checkpoints.☆262Updated last month
- project page of "RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning"☆18Updated 5 months ago
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆70Updated 7 months ago
- Official Code Release of Delphi☆54Updated last year
- FreeVS: Generative View Synthesis on Free Driving Trajectory☆135Updated 4 months ago
- [ICCV 2025] DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation☆74Updated last week
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆98Updated 5 months ago
- Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving (AAAI-25)☆45Updated 5 months ago
- CoRL2024 | Hint-AD: Holistically Aligned Interpretability for End-to-End Autonomous Driving☆62Updated 8 months ago
- [ICCV 2025] Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding☆55Updated 6 months ago
- [ECCV 2024] WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆107Updated 5 months ago
- Simulator-conditioned Driving Scene Generation☆118Updated 3 months ago
- [CVPR 2025] DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation☆53Updated last month
- Street-View Image Generation from a Bird’s-Eye View Layout: Official Codebase☆76Updated last year