leofan90 / Awesome-World-ModelsView external linksLinks
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related websites.
☆1,224Updated this week
Alternatives and similar repositories for Awesome-World-Models
Users that are interested in Awesome-World-Models are comparing it to the libraries listed below
Sorting:
- Collect some World Models for Autonomous Driving (and Robotic, etc.) papers.☆1,837Feb 1, 2026Updated 2 weeks ago
- A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts…☆1,962Updated this week
- [NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory☆337Dec 15, 2025Updated 2 months ago
- ICCV 2025 | TesserAct: Learning 4D Embodied World Models☆379Aug 4, 2025Updated 6 months ago
- A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (…☆2,550Updated this week
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆177Jun 20, 2025Updated 7 months ago
- [Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide☆11,884Jan 15, 2026Updated last month
- Official code for the CVPR 2025 paper "Navigation World Models".☆533Nov 24, 2025Updated 2 months ago
- [Actively Maintained🔥] A list of Embodied AI papers accepted by top conferences (ICLR, NeurIPS, ICML, RSS, CoRL, ICRA, IROS, CVPR, ICCV,…☆477Dec 1, 2025Updated 2 months ago
- ☆228Oct 3, 2025Updated 4 months ago
- A curated list of recent diffusion models for video generation, editing, and various other applications.☆5,451Feb 3, 2026Updated last week
- Official repo and evaluation implementation of VSI-Bench☆670Aug 5, 2025Updated 6 months ago
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation☆279Jul 8, 2025Updated 7 months ago
- Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources☆2,115Feb 3, 2026Updated last week
- List of papers on 4D Generation.☆322Oct 10, 2024Updated last year
- [CVPR2025] Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation☆143Jul 5, 2025Updated 7 months ago
- [ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆427Jan 7, 2026Updated last month
- [ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos☆473Mar 22, 2025Updated 10 months ago
- [RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion☆3,755Dec 24, 2024Updated last year
- A curated list of world models for autonomous driving.☆484Dec 23, 2025Updated last month
- ☆496Oct 30, 2025Updated 3 months ago
- A minimal implementation of DeepMind's Genie world model☆1,159Nov 22, 2025Updated 2 months ago
- Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the …☆787Jan 28, 2026Updated 2 weeks ago
- Gen3R: 3D Scene Generation Meets Feed-Forward Reconstruction☆179Jan 14, 2026Updated last month
- [Embodied-AI-Survey-2025] Paper List and Resource Repository for Embodied AI☆1,907Dec 17, 2025Updated last month
- [RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations☆1,251Oct 17, 2025Updated 3 months ago
- [SIGGRAPH 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control☆804Jun 9, 2025Updated 8 months ago
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling☆572Oct 26, 2025Updated 3 months ago
- [ICCV 2025, Oral] TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models☆835Dec 17, 2025Updated last month
- [NeurIPS 2024] SCube: Instant Large-Scale Scene Reconstruction using VoxSplats☆519Oct 14, 2025Updated 4 months ago
- [ICCV 2025] Dense Policy: Bidirectional Autoregressive Learning of Actions DSP☆72Jan 14, 2026Updated last month
- Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success☆1,037Sep 9, 2025Updated 5 months ago
- [IROS 2025 Best Paper Award Finalist & IEEE TRO 2026] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems☆2,775Dec 16, 2025Updated 2 months ago
- [ICLR 2026] LongLive: Real-time Interactive Long Video Generation☆1,040Jan 27, 2026Updated 2 weeks ago
- A curated list of awesome 3D scene generation papers. (arXiv 2505.05474)☆907Jan 17, 2026Updated 3 weeks ago
- Nav-R1: Reasoning and Navigation in Embodied Scenes☆110Oct 31, 2025Updated 3 months ago
- [ICLR’26] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control☆94Feb 8, 2026Updated last week
- PyTorch code and models for VJEPA2 self-supervised learning from video.☆2,954Aug 28, 2025Updated 5 months ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆79Dec 12, 2024Updated last year