leofan90 / Awesome-World-Models
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related websites.
☆75Updated last week
Alternatives and similar repositories for Awesome-World-Models:
Users that are interested in Awesome-World-Models are comparing it to the libraries listed below
- [RSS 2024] Learning Manipulation by Predicting Interaction☆100Updated 6 months ago
- Awesome Papers about World Models in Autonomous Driving☆76Updated 10 months ago
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆84Updated last month
- CoRL2024 | Hint-AD: Holistically Aligned Interpretability for End-to-End Autonomous Driving☆55Updated 4 months ago
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆62Updated 3 months ago
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆89Updated last month
- A Multi-Modal Large Language Model with Retrieval-augmented In-context Learning capacity designed for generalisable and explainable end-t…☆83Updated 4 months ago
- [NeurIPS 2024] CLOVER: Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation☆98Updated 3 months ago
- ☆12Updated 9 months ago
- ☆50Updated 2 weeks ago
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆86Updated 3 months ago
- Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"☆76Updated 2 weeks ago
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆111Updated this week
- Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆95Updated 2 months ago
- Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives☆50Updated last week
- ☆56Updated 6 months ago
- Latest Advances on Embodied Multimodal LLMs (or Vison-Language-Action Models).☆104Updated 8 months ago
- Simulator-conditioned Driving Scene Generation☆97Updated 3 weeks ago
- AutoTrust, a groundbreaking benchmark designed to assess the trustworthiness of DriveVLMs. This work aims to enhance public safety by ens…☆38Updated 2 months ago
- GPD-1: Generative Pre-training for Driving☆71Updated 2 months ago
- [ECCV 2024] The official code for "Dolphins: Multimodal Language Model for Driving“☆64Updated 3 weeks ago
- ☆364Updated 9 months ago
- Simulator designed to generate diverse driving scenarios.☆41Updated last week
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.☆151Updated this week
- The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`☆88Updated 2 months ago
- List of papers on video-centric robot learning☆14Updated 3 months ago
- Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning☆47Updated last month