A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.
☆2,091Feb 23, 2026Updated 2 weeks ago
Alternatives and similar repositories for Awesome-World-Models
Users that are interested in Awesome-World-Models are comparing it to the libraries listed below
Sorting:
- A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and A…☆1,290Updated this week
- [NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory☆340Feb 21, 2026Updated 2 weeks ago
- [SIGGRAPH 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control☆808Jun 9, 2025Updated 9 months ago
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling☆578Oct 26, 2025Updated 4 months ago
- Official implementation of Continuous 3D Perception Model with Persistent State☆1,353Aug 27, 2025Updated 6 months ago
- Collect some World Models for Autonomous Driving (and Robotic, etc.) papers.☆1,861Feb 1, 2026Updated last month
- 🌟A curated list of DUSt3R-related papers and resources, tracking recent advancements using this geometric foundation model.☆788Nov 5, 2025Updated 4 months ago
- [ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos☆474Mar 22, 2025Updated 11 months ago
- Code implementation of the paper "World-in-World: World Models in a Closed-Loop World" (ICLR'26 Oral)☆139Feb 15, 2026Updated 3 weeks ago
- ☆281Feb 3, 2026Updated last month
- Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"☆1,337Jun 16, 2025Updated 8 months ago
- ViPE: Video Pose Engine for Geometric 3D Perception☆1,749Jan 1, 2026Updated 2 months ago
- A list of works on video generation towards world model☆387Feb 11, 2026Updated 3 weeks ago
- Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals☆1,033Updated this week
- Code for ICCV'2025 (Best student paper honorable mention) "RayZer: A Self-supervised Large View Synthesis Model"☆415Nov 24, 2025Updated 3 months ago
- Native Multimodal Models are World Learners☆1,464Dec 30, 2025Updated 2 months ago
- [ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆437Feb 25, 2026Updated last week
- [ICLR 2026] Streaming 4D Visual Geometry Transformer☆838Oct 27, 2025Updated 4 months ago
- [ICLR 2025 Oral] Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias"☆505Aug 4, 2025Updated 7 months ago
- 🌐 3D and 4D World Modeling: A Survey☆841Feb 21, 2026Updated 2 weeks ago
- A curated list of awesome 3D scene generation papers. (arXiv 2505.05474)☆927Jan 17, 2026Updated last month
- [ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"☆635Jul 1, 2025Updated 8 months ago
- Cameras as Relative Positional Encoding☆683Dec 18, 2025Updated 2 months ago
- Collection of forcing related autoregressive video Gen☆86Feb 27, 2026Updated last week
- A paper list for spatial reasoning☆671Jan 19, 2026Updated last month
- Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"☆1,240Jan 5, 2026Updated 2 months ago
- [TPAMI 2025] ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis☆1,518Dec 13, 2025Updated 2 months ago
- Stereo4D dataset and processing code☆292Nov 4, 2025Updated 4 months ago
- Open-source unified multimodal model☆5,723Oct 27, 2025Updated 4 months ago
- [NeurIPS 2025 (Spotlight)] The implementation for the paper "4DGT Learning a 4D Gaussian Transformer Using Real-World Monocular Videos"☆404Sep 19, 2025Updated 5 months ago
- Depth Anything 3☆4,550Dec 12, 2025Updated 2 months ago
- [ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video☆1,756Nov 28, 2025Updated 3 months ago
- Code release for https://kovenyu.com/WonderWorld/☆714Apr 14, 2025Updated 10 months ago
- [ICCV 2025] Implementation for Describe Anything: Detailed Localized Image and Video Captioning☆1,456Jun 26, 2025Updated 8 months ago
- Awesome Unified Multimodal Models☆1,134Feb 6, 2026Updated last month
- [ICLR 2026] π^3: Permutation-Equivariant Visual Geometry Learning☆1,673Feb 27, 2026Updated last week
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆1,513Updated this week
- (CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models☆1,244Aug 7, 2025Updated 7 months ago
- [CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision☆2,328Nov 2, 2025Updated 4 months ago