valeoai / VideoActionModel
VaViM and VaVAM: Autonomous Driving through Video Generative Modeling (official repository).
☆85Updated 2 months ago
Alternatives and similar repositories for VideoActionModel
Users that are interested in VideoActionModel are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆67Updated 5 months ago
- The official repository for the ECCV2024 paper "CarFormer: Self-Driving with Learned Object-Centric Representations"☆42Updated 4 months ago
- [CVPR 2025, Spotlight] SimLingo (CarLLava): Vision-Only Closed-Loop Autonomous Driving with Language-Action Alignment☆21Updated this week
- Official Github Repo for GEM☆52Updated 2 weeks ago
- Codebase for the WayveScenes101 Dataset☆176Updated 7 months ago
- Simulator-conditioned Driving Scene Generation☆113Updated 3 weeks ago
- ☆41Updated 6 months ago
- [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆138Updated last month
- Nexus: Decoupled Diffusion Sparks Adaptive Scene Generation☆35Updated 3 weeks ago
- Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆79Updated 5 months ago
- [CoRL 2023] The official code for paper "Language Conditioned Traffic Generation"☆80Updated 10 months ago
- The official repository of Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation☆25Updated 10 months ago
- GPD-1: Generative Pre-training for Driving☆73Updated 5 months ago
- [NeurIPS 2024] DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features☆29Updated 5 months ago
- Source code for NeurIPS paper "POP-3D: Open-Vocabulary 3D Occupancy Prediction from Images"☆106Updated 4 months ago
- ☆108Updated 10 months ago
- OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving☆173Updated 11 months ago
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆123Updated 2 months ago
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆90Updated 3 months ago
- Generative World Explorer☆143Updated 5 months ago
- [ECCV 2024] WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆103Updated 3 months ago
- Official code for the CVPR 2025 paper "Navigation World Models".☆104Updated last month
- This is the official implementation of UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving☆42Updated last week
- [CVPR 2025] Official Repository for Scenario Dreamer: Vectorized Latent Diffusion for Generating Driving Simulation Environments☆124Updated last month
- Street-View Image Generation from a Bird’s-Eye View Layout: Official Codebase☆75Updated last year
- ☆91Updated 3 months ago
- ☆38Updated last week
- ☆16Updated last month
- A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and A…☆110Updated last week
- CoRL2024 | Hint-AD: Holistically Aligned Interpretability for End-to-End Autonomous Driving☆57Updated 6 months ago