xiaomi-research / genesisLinks
Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency
☆36Updated 2 months ago
Alternatives and similar repositories for genesis
Users that are interested in genesis are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆162Updated last month
- [ICCV 2025] Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆83Updated 8 months ago
- [CVPR 2025] ReconDreamer☆172Updated 8 months ago
- Official Github Repo for GEM☆83Updated last week
- Code for "DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT"☆205Updated 7 months ago
- ☆113Updated 7 months ago
- [CVPR 2025] DriveDreamer4D☆224Updated 5 months ago
- Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models☆198Updated last month
- Official Code for Epona: Autoregressive Diffusion World Model for Autonomous Driving (ICCV 2025)☆175Updated last month
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆127Updated 6 months ago
- [ICCV 2025] Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding☆56Updated 7 months ago
- Code for CVPR2025 paper: Generating Multimodal Driving Scenes via Next-Scene Prediction☆71Updated 5 months ago
- FreeVS: Generative View Synthesis on Free Driving Trajectory☆140Updated 6 months ago
- [ICLR 2025 Spotlight] Official implementation for "DynamicCity: Large-Scale 4D Occupancy Generation from Dynamic Scenes"☆212Updated 2 weeks ago
- [CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"☆239Updated last year
- Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving (AAAI-25)☆50Updated 6 months ago
- OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving☆180Updated last year
- ☆29Updated 3 months ago
- [ECCV 2024] WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆109Updated 6 months ago
- ☆89Updated 8 months ago
- official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*☆50Updated 7 months ago
- [AAAI 2025] DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation☆194Updated 5 months ago
- ☆20Updated 5 months ago
- CoRL2024 | Hint-AD: Holistically Aligned Interpretability for End-to-End Autonomous Driving☆64Updated 10 months ago
- Official Code Release of Delphi☆54Updated last year
- Project Page for GaussianFormer☆24Updated last year
- [ICCV 2025] InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models☆65Updated 2 months ago
- official code of "MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction"☆76Updated 5 months ago
- [CVPR 2025] DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation☆57Updated 3 months ago
- An open source code repository of driving world models, with training, inferencing, evaluation tools, and pretrained checkpoints.☆282Updated 2 months ago