valeoai / VideoActionModelLinks
VaViM and VaVAM: Autonomous Driving through Video Generative Modeling (official repository).
☆96Updated last month
Alternatives and similar repositories for VideoActionModel
Users that are interested in VideoActionModel are comparing it to the libraries listed below
Sorting:
- Codebase for the WayveScenes101 Dataset☆185Updated 10 months ago
- The official repository for the ECCV2024 paper "CarFormer: Self-Driving with Learned Object-Centric Representations"☆48Updated 7 months ago
- Simulator-conditioned Driving Scene Generation☆120Updated 3 months ago
- Official Github Repo for GEM☆80Updated last month
- Code for "DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT"☆201Updated 6 months ago
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆71Updated 8 months ago
- [CVPR 2025 Highlight] Towards Autonomous Micromobility through Scalable Urban Simulation☆100Updated 2 weeks ago
- ReSim: Reliable World Simulation for Autonomous Driving☆105Updated last month
- Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models☆174Updated last week
- [ICLR 2025 Spotlight] MetaUrban: An Embodied AI Simulation Platform for Urban Micromobility☆198Updated last month
- Official code for the CVPR 2025 paper "Navigation World Models".☆338Updated last week
- [CVPR 2025] Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers☆34Updated last month
- [ICCV 2025] Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆83Updated 8 months ago
- ☆108Updated 6 months ago
- ☆43Updated last month
- a comprehensive and critical synthesis of the emerging role of GenAI across the full autonomous driving stack☆117Updated 2 months ago
- [ICCV 2025] ETA: Efficiency through Thinking Ahead, A Dual Approach to Self-Driving with Large Models☆30Updated last month
- Awesome Papers about World Models in Autonomous Driving☆85Updated last year
- Street-View Image Generation from a Bird’s-Eye View Layout: Official Codebase☆75Updated last year
- [ICCV 2025] Detect Anything 3D in the Wild☆163Updated last month
- [CVPR 2025] Official Repository for Scenario Dreamer: Vectorized Latent Diffusion for Generating Driving Simulation Environments☆181Updated 2 weeks ago
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆126Updated 5 months ago
- OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving☆178Updated last year
- ICCV 2025 | Nexus: Decoupled Diffusion Sparks Adaptive Scene Generation☆85Updated last month
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆100Updated 6 months ago
- [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆156Updated last month
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆111Updated 2 months ago
- CoRL2024 | Hint-AD: Holistically Aligned Interpretability for End-to-End Autonomous Driving☆63Updated 9 months ago
- [ICLR 2025 Spotlight] Official implementation for "DynamicCity: Large-Scale 4D Occupancy Generation from Dynamic Scenes"☆210Updated last month
- [ArXiv 2025] CaRL: Learning Scalable Planning Policies with Simple Rewards☆48Updated 3 weeks ago