knightnemo / Awesome-World-ModelsView external linksLinks
A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.
☆1,962Updated this week
Alternatives and similar repositories for Awesome-World-Models
Users that are interested in Awesome-World-Models are comparing it to the libraries listed below
Sorting:
- A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and A…☆1,224Updated this week
- [NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory☆337Dec 15, 2025Updated 2 months ago
- Collect some World Models for Autonomous Driving (and Robotic, etc.) papers.☆1,837Feb 1, 2026Updated 2 weeks ago
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling☆572Oct 26, 2025Updated 3 months ago
- [SIGGRAPH 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control☆804Jun 9, 2025Updated 8 months ago
- ☆280Feb 3, 2026Updated last week
- Official implementation of Continuous 3D Perception Model with Persistent State☆1,336Aug 27, 2025Updated 5 months ago
- 🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.☆787Nov 5, 2025Updated 3 months ago
- Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"☆1,334Jun 16, 2025Updated 7 months ago
- A list of works on video generation towards world model☆343Updated this week
- ViPE: Video Pose Engine for Geometric 3D Perception☆1,717Jan 1, 2026Updated last month
- Code for ICCV'2025 (Best student paper honorable mention) "RayZer: A Self-supervised Large View Synthesis Model"☆408Nov 24, 2025Updated 2 months ago
- [ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos☆473Mar 22, 2025Updated 10 months ago
- Native Multimodal Models are World Learners☆1,456Dec 30, 2025Updated last month
- Code to load DreamZero model checkpoints and run evaluation on DROID-sim and Genie Sim 3.0☆664Updated this week
- [ICLR 2025 Oral] Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias"☆494Aug 4, 2025Updated 6 months ago
- A paper list for spatial reasoning☆643Jan 19, 2026Updated 3 weeks ago
- [ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆427Jan 7, 2026Updated last month
- 🌐 3D and 4D World Modeling: A Survey☆806Jan 17, 2026Updated 3 weeks ago
- A curated list of awesome 3D scene generation papers. (arXiv 2505.05474)☆907Jan 17, 2026Updated 3 weeks ago
- Code implementation of the paper "World-in-World: World Models in a Closed-Loop World" (ICLR'26 Oral)☆124Dec 22, 2025Updated last month
- [ICLR 2026] Streaming 4D Visual Geometry Transformer☆828Oct 27, 2025Updated 3 months ago
- [NeurIPS 2025 (Spotlight)] The implementation for the paper "4DGT Learning a 4D Gaussian Transformer Using Real-World Monocular Videos"☆395Sep 19, 2025Updated 4 months ago
- Open-source unified multimodal model☆5,654Oct 27, 2025Updated 3 months ago
- Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"☆1,225Jan 5, 2026Updated last month
- [ICLR 2026] π^3: Permutation-Equivariant Visual Geometry Learning☆1,630Jan 28, 2026Updated 2 weeks ago
- [ICCV 2025] Implementation for Describe Anything: Detailed Localized Image and Video Captioning☆1,450Jun 26, 2025Updated 7 months ago
- Depth Anything 3☆4,359Dec 12, 2025Updated 2 months ago
- Code release for https://kovenyu.com/WonderWorld/☆711Apr 14, 2025Updated 10 months ago
- (CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models☆1,215Aug 7, 2025Updated 6 months ago
- [CVPR2025] Feat2GS: Probing Visual Foundation Models with Gaussian Splatting☆230Jul 25, 2025Updated 6 months ago
- Awesome Unified Multimodal Models☆1,108Feb 6, 2026Updated last week
- Cameras as Relative Positional Encoding☆677Dec 18, 2025Updated last month
- The official code of Yume☆612Jan 14, 2026Updated last month
- SpatialVID: A Large-Scale Video Dataset with Spatial Annotations☆497Updated this week
- Stereo4D dataset and processing code☆291Nov 4, 2025Updated 3 months ago
- Official repo for: Epipolar Geometry Improves Video Generation Models☆77Oct 28, 2025Updated 3 months ago
- Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"☆185Dec 29, 2025Updated last month
- [CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer☆12,448Oct 11, 2025Updated 4 months ago