knightnemo / Awesome-World-ModelsLinks
A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.
☆339Updated this week
Alternatives and similar repositories for Awesome-World-Models
Users that are interested in Awesome-World-Models are comparing it to the libraries listed below
Sorting:
- An ML research template with good documentation by Boyuan Chen, an MIT PhD student☆90Updated 7 months ago
- ☆149Updated 9 months ago
- [NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory☆253Updated last week
- [ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".☆168Updated 4 months ago
- Benchmarking physical understanding in generative video models☆210Updated last month
- [NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆86Updated this week
- 📖 This is a repository for organizing papers, codes and other resources related to Visual Reinforcement Learning.☆312Updated last week
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆157Updated 3 weeks ago
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆191Updated 5 months ago
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆73Updated 5 months ago
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆369Updated 4 months ago
- Generative World Explorer☆158Updated 4 months ago
- Visual Planning: Let's Think Only with Images☆280Updated 5 months ago
- Cosmos-Predict2 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…☆647Updated this week
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆170Updated 3 weeks ago
- Official Repo of From Masks to Worlds: A Hitchhiker’s Guide to World Models.☆34Updated last week
- This repo contains the code for the paper "Intuitive physics understanding emerges fromself-supervised pretraining on natural videos"☆191Updated 8 months ago
- Code release for paper "Test-Time Training Done Right"☆301Updated last month
- Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens (arXiv 2025)☆184Updated 3 months ago
- An open-source lightweight game generation paradigm. It includes everything from data processing to model architecture design and playabi…☆114Updated 9 months ago
- Official implementation for BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation☆87Updated 3 months ago
- NEO Series: Native Vision-Language Models from First Principles☆215Updated last week
- Virtual Community: An Open World for Humans, Robots, and Society☆176Updated last week
- This is the code repository for IntPhys 2, a video benchmark designed to evaluate the intuitive physics understanding of deep learning mo…☆79Updated last week
- [ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆334Updated last week
- OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆377Updated 2 weeks ago
- ☆97Updated last month
- Native Multimodal Models are World Learners☆772Updated this week
- A Video Tokenizer Evaluation Dataset☆136Updated 9 months ago
- PhysX: Physical-Grounded 3D Asset Generation (NeurIPS 2025, Spotlight)☆288Updated last month