A Curated List of Awesome Video World Models with AR Diffusion: Covering Algorithms, Applications, and Infrastructure, Aimed at Serving as a Comprehensive Resource for Researchers, Practitioners, and Enthusiasts.
☆533May 15, 2026Updated last week
Alternatives and similar repositories for Awesome-Video-World-Models-with-AR-Diffusion
Users that are interested in Awesome-Video-World-Models-with-AR-Diffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Evaluation codes and data for GenEval2☆71Jan 8, 2026Updated 4 months ago
- Minute-long video generation at 24FPS.☆67Mar 28, 2026Updated last month
- [ CVPR 2026 ] MoLingo: Motion-Language Alignment for Text-to-Motion Generation☆61Apr 20, 2026Updated last month
- (CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models☆1,339Aug 7, 2025Updated 9 months ago
- Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries☆42Nov 19, 2025Updated 6 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICML 2026] Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactiv…☆701Updated this week
- [ICLR 2026] Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time☆396Oct 31, 2025Updated 6 months ago
- Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"☆205Dec 29, 2025Updated 4 months ago
- A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and A…☆1,673Updated this week
- Official PyTorch implementation for "Effective and Efficient Masked Image Generation Models"☆33Apr 8, 2025Updated last year
- ☆36Jun 7, 2024Updated last year
- LongLive 2.0: Infra - Long Video Gen☆1,647Updated this week
- The first open-domain closed-loop revisited benchmark for evaluating memory consistency and action control in world models.☆58Feb 10, 2026Updated 3 months ago
- [NeurIPS 2024] Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features☆25Mar 20, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation☆49Apr 14, 2025Updated last year
- The official implementations of Intention-conditioned Flow Occupancy Models (InFOM)☆35May 11, 2026Updated 2 weeks ago
- [ICCV 2025 ⭐highlight⭐] Implementation of VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory☆430Jul 25, 2025Updated 10 months ago
- ☆17Sep 10, 2021Updated 4 years ago
- [ICLR 2026] UniVideo: Unified Understanding, Generation, and Editing for Videos☆517Feb 11, 2026Updated 3 months ago
- ☆46Feb 20, 2026Updated 3 months ago
- [SIGGRAPH 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control☆817Jun 9, 2025Updated 11 months ago
- Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models☆140Mar 24, 2026Updated 2 months ago
- Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation☆132Apr 28, 2026Updated 3 weeks ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts…☆2,846May 15, 2026Updated last week
- [CVPR 2026] Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video☆116Mar 24, 2026Updated 2 months ago
- Toy-scale unified multimodal model experiments — encoder-free understanding & generation with Mixture-of-Transformers on MLX/Apple Silico…☆42Mar 8, 2026Updated 2 months ago
- This is a project on visual spatial reasoning tasks-SIBench☆26Jan 12, 2026Updated 4 months ago
- A Minimal and Elegant Framework & Tutorial for Real-Time Interactive World Models☆113May 18, 2026Updated last week
- [CVPR 2025 Highlight] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control☆1,347Sep 24, 2025Updated 8 months ago
- ☆22Apr 17, 2024Updated 2 years ago
- [ECCV 2024] 3DPE: Real-time 3D-aware Portrait Editing from a Single Image☆22Sep 15, 2025Updated 8 months ago
- A unified inference and post-training framework for accelerated video generation.☆3,504Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Advancing Open-source World Models☆3,764May 10, 2026Updated 2 weeks ago
- [ICML 2025] Official Code for "ADHMR: Aligning Diffusion-based Human Mesh Recovery via Direct Preference Optimization"☆42Feb 13, 2026Updated 3 months ago
- [ CVPR 2025 ] We introduce LT3SD, a novel latent 3D scene diffusion approach enabling high-fidelity generation of infinite 3D environment…☆193Oct 28, 2025Updated 6 months ago
- ☆139Dec 19, 2025Updated 5 months ago
- Distilling Neural Fields for Real-Time Articulated Shape Reconstruction (CVPR'23)☆20Jul 11, 2023Updated 2 years ago
- Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)☆3,349Sep 12, 2025Updated 8 months ago
- HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency☆1,503Apr 15, 2026Updated last month