M-E-AGI-Lab/Awesome-World-Models

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/M-E-AGI-Lab/Awesome-World-Models)

M-E-AGI-Lab / Awesome-World-Models

Official Repo of From Masks to Worlds: A Hitchhiker’s Guide to World Models.

☆96

Alternatives and similar repositories for Awesome-World-Models

Users that are interested in Awesome-World-Models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

M-E-AGI-Lab / Muddit
View on GitHub
[ICLR 2026] Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusio…
☆119Apr 13, 2026Updated 3 months ago
viiika / Prism
View on GitHub
[ICML 2026] Official Implementation of Prism: Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diff…
☆22Mar 4, 2026Updated 4 months ago
Shi-qingyu / RecTok
View on GitHub
[CVPR 26] Official PyTorch Implementation of RecTok
☆23Feb 24, 2026Updated 5 months ago
viiika / HumanEdit
View on GitHub
[CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…
☆36May 8, 2025Updated last year
furiosa-ai / uncage
View on GitHub
UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation
☆17Aug 12, 2025Updated 11 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
collovlabs / ViewControl
View on GitHub
[IJCAI 2024] Official implementation of the paper "Integrating View Conditions for Image Synthesis"
☆25Aug 27, 2024Updated last year
viiika / Diffusion-Conductor
View on GitHub
[AAAI 2023 Summer Symposium, Best Paper Award] Taming Diffusion Models for Music-driven Conducting Motion Generation
☆26May 9, 2024Updated 2 years ago
knightnemo / Awesome-World-Models
View on GitHub
A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts…
☆3,223Updated this week
ByteDance-Seed / SAIL
View on GitHub
Implementation for "The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer"
☆85Oct 29, 2025Updated 8 months ago
DAGroup-PKU / SpatialT2I
View on GitHub
[CVPR 2026🔥] Enhancing Spatial Understanding in Image Generation via Reward Modeling
☆85Mar 2, 2026Updated 4 months ago
CompVis / DisMo
View on GitHub
[NeurIPS 2025] DisMo: DIsentangled Motion Representations for Open-World Motion Transfer
☆31Dec 14, 2025Updated 7 months ago
Owen718 / LongPrompt-LLamaGen
View on GitHub
This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…
☆30Oct 21, 2024Updated last year
viiika / Meissonic
View on GitHub
[ICLR 2025] Official Implementation of Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image…
☆345Jul 15, 2026Updated last week
arielshaulov / TokenTrim
View on GitHub
Official implementation of the paper "TOKENTRIM: INFERENCE-TIME TOKEN PRUNING FOR AUTOREGRESSIVE LONG VIDEO GENERATION"
☆15Feb 8, 2026Updated 5 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
ziqihuangg / Awesome-From-Video-Generation-to-World-Model
View on GitHub
A list of works on video generation towards world model
☆502Mar 21, 2026Updated 4 months ago
EgoAlpha / Egocentric-Dataset
View on GitHub
☆39Mar 24, 2022Updated 4 years ago
ashawkey / camera_viewer
View on GitHub
Camera pose visualizer
☆33May 27, 2026Updated last month
Tencent-Hunyuan / iFSQ
View on GitHub
iFSQ & LlamaGen-REPA
☆102Jan 27, 2026Updated 5 months ago
mikeallen39 / FlowCache
View on GitHub
[ICLR2026] The open-source code for FlowCache, including accelerated implementations of the MAGI-1 and Skyreels-V2.
☆29Apr 24, 2026Updated 3 months ago
yuanzhi-zhu / DiMO
View on GitHub
[ICCV2025] "Di[M]O: Distilling Masked Diffusion Models into One-step Generator", Yuanzhi Zhu, Xi Wang, Stéphane Lathuilière, Vicky Kal…
☆39Aug 14, 2025Updated 11 months ago
jiaming-zhou / Zero-WAM
View on GitHub
Zero-WAM, an in-context world model for zero-shot robotic task generalization
☆33Jul 8, 2026Updated 2 weeks ago
GregxmHu / OccuBench
View on GitHub
OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models
☆21Apr 14, 2026Updated 3 months ago
wangf3014 / VTok
View on GitHub
Official implementation of VTok: A Unified Video Tokenizer with Decoupled Spatial-Temporal Latents
☆15Feb 5, 2026Updated 5 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Alpha-VLLM / Lumina-DiMOO
View on GitHub
Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model
☆1,003May 19, 2026Updated 2 months ago
baaivision / Emu3.5
View on GitHub
Native Multimodal Models are World Learners
☆1,538Dec 30, 2025Updated 6 months ago
world-model-eval / world-model-eval
View on GitHub
Code for "Evaluating Robot Policies in a World Model".
☆100Nov 6, 2025Updated 8 months ago
ealicesora / Awesome-Autoregressive-Video-Diffusion
View on GitHub
Collection of forcing related autoregressive video Gen
☆98Mar 31, 2026Updated 3 months ago
miccooper9 / egowm
View on GitHub
☆55Jan 26, 2026Updated 6 months ago
lucasjinreal / LLaVA-Magvit2
View on GitHub
LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.
☆38Jun 20, 2024Updated 2 years ago
leeruibin / hybrid-forcing
View on GitHub
☆32Apr 29, 2026Updated 2 months ago
leofan90 / Awesome-World-Models
View on GitHub
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and A…
☆1,914Updated this week
WenjieShu / LoopViT
View on GitHub
☆46Feb 4, 2026Updated 5 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
BienLuky / Rectified-SpaAttn
View on GitHub
The official implementation of "Rectified SpaAttn: Revisiting Attention Sparsity for Efficient Video Generation"
☆22Feb 8, 2026Updated 5 months ago
X-Omni-Team / X-Omni
View on GitHub
Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).
☆426Aug 26, 2025Updated 11 months ago
ML-GSAI / SMDM
View on GitHub
Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"
☆384Dec 22, 2024Updated last year
PKU-YuanGroup / OSP-Next
View on GitHub
OSP-Next
☆67Jun 22, 2026Updated last month
Ephemeral182 / Empirical-Study-of-GPT-4o-Image-Gen
View on GitHub
An Empirical Study of GPT-4o Image Generation Capabilities
☆29Apr 16, 2025Updated last year
Aaron617 / text2world
View on GitHub
[ACL 2025 Findings] Text2World: Benchmarking Large Language Models for Symbolic World Model Generation
☆29Feb 25, 2025Updated last year
recuriosity / recuriosity
View on GitHub
Code for the paper "Remember to be Curious: Episodic Context and Persistent Worlds for 3D Exploration"
☆57May 22, 2026Updated 2 months ago