knightnemo / Awesome-World-ModelsLinks
A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.
☆1,630Updated 2 weeks ago
Alternatives and similar repositories for Awesome-World-Models
Users that are interested in Awesome-World-Models are comparing it to the libraries listed below
Sorting:
- A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and A…☆1,034Updated this week
- [ACM CSUR 2025] Understanding World or Predicting Future? A Comprehensive Survey of World Models☆353Updated last month
- 📖 This is a repository for organizing papers, codes and other resources related to Visual Reinforcement Learning.☆375Updated this week
- A paper list for spatial reasoning☆588Updated 2 weeks ago
- RynnVLA-002: A Unified Vision-Language-Action and World Model☆818Updated last month
- Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the …☆598Updated this week
- Cosmos-Predict2 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…☆695Updated 2 months ago
- Official repo and evaluation implementation of VSI-Bench☆658Updated 5 months ago
- Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long c…☆868Updated 2 weeks ago
- A collection of paper/projects that trains flow matching model/policies via RL.☆342Updated 2 weeks ago
- Cosmos-Transfer1 is a world-to-world transfer model designed to bridge the perceptual divide between simulated and real-world environment…☆754Updated 2 months ago
- Official code for the CVPR 2025 paper "Navigation World Models".☆493Updated last month
- Cosmos-Predict1 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…☆392Updated 4 months ago
- [RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions☆925Updated last month
- Paper list in the survey: A Survey on Vision-Language-Action Models: An Action Tokenization Perspective☆395Updated 6 months ago
- Cambrian-S: Towards Spatial Supersensing in Video☆455Updated last week
- Cosmos-Transfer2.5, built on top of Cosmos-Predict2.5, produces high-quality world simulations conditioned on multiple spatial control in…☆304Updated last week
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆419Updated last month
- A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vi…☆783Updated 3 weeks ago
- [ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".☆190Updated 6 months ago
- [ICLR 2025] LAPA: Latent Action Pretraining from Videos☆434Updated 11 months ago
- Compose multimodal datasets 🎹☆532Updated 5 months ago
- SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning☆1,173Updated 2 months ago
- ☆423Updated 2 weeks ago
- [ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"☆587Updated 6 months ago
- 🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.☆617Updated 6 months ago
- Official PyTorch Implementation of Unified Video Action Model (RSS 2025)☆308Updated 5 months ago
- StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing☆725Updated this week
- ☆350Updated 9 months ago
- Nvidia GEAR Lab's initiative to solve the robotics data problem using world models☆428Updated 2 months ago