[NeurIPS 2025]Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency
☆87Sep 19, 2025Updated 8 months ago
Alternatives and similar repositories for genesis
Users that are interested in genesis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- My personal portfolio of robot learning algorithms☆11Mar 27, 2025Updated last year
- ☆15Aug 7, 2025Updated 9 months ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆33Dec 22, 2025Updated 5 months ago
- [ICCV 2025] Official code of "ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation"☆640Dec 10, 2025Updated 5 months ago
- OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models☆123Apr 25, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*☆65Apr 12, 2026Updated last month
- Official Code for Epona: Autoregressive Diffusion World Model for Autonomous Driving (ICCV 2025)☆344Jul 22, 2025Updated 10 months ago
- [NeurIPS 2025 Spotlight] ReSim: Reliable World Simulation for Autonomous Driving☆170May 5, 2026Updated 3 weeks ago
- Realistic 3D Asset Insertion via Diffusion for Autonomous Driving Simulation☆70Aug 15, 2025Updated 9 months ago
- Code for CVPR2025 paper: Generating Multimodal Driving Scenes via Next-Scene Prediction☆106Nov 7, 2025Updated 6 months ago
- [ICCV 2025] Driving Scene Synthesis on Free-form Trajectories with Generative Prior☆39Jun 28, 2025Updated 10 months ago
- Code for "DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT"☆244Jan 15, 2025Updated last year
- Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenes☆29Mar 12, 2026Updated 2 months ago
- ☆22Mar 22, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆59Jun 8, 2025Updated 11 months ago
- [ECCV 2024] WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆113Feb 6, 2025Updated last year
- [CVPR'25] MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models☆65May 27, 2025Updated last year
- NeurIPS2024-Papers-about-Autonomous-Driving☆19Nov 18, 2024Updated last year
- Real-Time RTUs☆12Mar 20, 2026Updated 2 months ago
- [ICLR 2026] ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving☆534Mar 13, 2026Updated 2 months ago
- [ICLR 2025 Oral] Official Implementation for "Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference Un…☆21Oct 24, 2024Updated last year
- [ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping☆47Nov 21, 2025Updated 6 months ago
- ☆46Jun 3, 2025Updated 11 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Towards Photorealistic 4D Scene Generation via Video Diffusion Models☆19Jun 12, 2024Updated last year
- EGSRAL: An Enhanced 3D Gaussian Splatting based Renderer with Automated Labeling for Large-Scale Driving Scene☆25Dec 23, 2024Updated last year
- LiDAR Registration with Visual Foundation Models☆63Dec 15, 2025Updated 5 months ago
- An open source code repository of driving world models, with training, inferencing, evaluation tools, and pretrained checkpoints.☆395Jun 19, 2025Updated 11 months ago
- OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving☆198May 31, 2024Updated last year
- Official implementation of HEAD CoRL 2025☆26Aug 22, 2025Updated 9 months ago
- Simple tool to visualize COLMAP sparse/dense reconstruction using Rerun.☆35May 6, 2025Updated last year
- [ICCV 2025] Pytorch implementation of "VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Pr…☆55Jul 28, 2025Updated 9 months ago
- [CVPR 2026 Highlight] VideoITG: Multimodal Video Understanding with Instructed Temporal Grounding☆118Apr 17, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆71Nov 2, 2025Updated 6 months ago
- ☆20Jun 4, 2025Updated 11 months ago
- ☆13May 30, 2025Updated 11 months ago
- A Multimodal Generative World Model for Autonomous Driving with Geometric Representations☆14Aug 27, 2025Updated 9 months ago
- FreeVS: Generative View Synthesis on Free Driving Trajectory☆162Feb 22, 2025Updated last year
- [ECCV'24] Unrolled Decomposed Unpaired Learning for Controllable Low-Light Video Enhancement☆13May 6, 2025Updated last year
- Official implementation of the paper "HUGSIM: A Real-Time, Photo-Realistic and Closed-Loop Simulator for Autonomous Driving"☆384Nov 8, 2025Updated 6 months ago