A Curated List of Awesome Video World Models with AR Diffusion: Covering Algorithms, Applications, and Infrastructure, Aimed at Serving as a Comprehensive Resource for Researchers, Practitioners, and Enthusiasts.
☆272Mar 15, 2026Updated this week
Alternatives and similar repositories for Awesome-Video-World-Models-with-AR-Diffusion
Users that are interested in Awesome-Video-World-Models-with-AR-Diffusion are comparing it to the libraries listed below
Sorting:
- Minute-long video generation at 24FPS.☆58Feb 2, 2026Updated last month
- Evaluation codes and data for GenEval2☆60Jan 8, 2026Updated 2 months ago
- Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries☆35Nov 19, 2025Updated 4 months ago
- [CVPR 2026] MoLingo: Motion-Language Alignment for Text-to-Motion Generation☆54Mar 7, 2026Updated last week
- Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Gene…☆456Updated this week
- The first open-domain closed-loop revisited benchmark for evaluating memory consistency and action control in world models.☆48Feb 10, 2026Updated last month
- ☆36Jun 7, 2024Updated last year
- (CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models☆1,258Aug 7, 2025Updated 7 months ago
- [NeurIPS 2024] Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features☆24Mar 20, 2025Updated last year
- The official implementations of Intention-conditioned Flow Occupancy Models (InFOM)☆31Feb 5, 2026Updated last month
- ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation☆49Apr 14, 2025Updated 11 months ago
- ☆43Feb 20, 2026Updated last month
- ☆17Sep 10, 2021Updated 4 years ago
- Official implementation of FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment☆34Feb 24, 2026Updated 3 weeks ago
- This is a project on visual spatial reasoning tasks-SIBench☆25Jan 12, 2026Updated 2 months ago
- Official code release for ICCV2025 paper (Highlight): MoGA: 3D Generative Avatar Prior for Monocular Gaussian Avatar Reconstruction☆47Oct 20, 2025Updated 5 months ago
- Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"☆187Dec 29, 2025Updated 2 months ago
- Scalable Minecraft multiplayer data collection engine☆114Updated this week
- Rethinking Video Generation Model for the Embodied World☆54Feb 12, 2026Updated last month
- [ICML 2025] Official Code for "ADHMR: Aligning Diffusion-based Human Mesh Recovery via Direct Preference Optimization"☆42Feb 13, 2026Updated last month
- A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and A…☆1,334Mar 5, 2026Updated 2 weeks ago
- Infinite-Forcing: Towards Infinite-Long Video Generation☆139Nov 13, 2025Updated 4 months ago
- Distilling Neural Fields for Real-Time Articulated Shape Reconstruction (CVPR'23)☆20Jul 11, 2023Updated 2 years ago
- [ICLR 2026] Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time☆344Oct 31, 2025Updated 4 months ago
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆277Mar 9, 2026Updated last week
- [ICCV 2025] TokensGen: Harnessing Condensed Tokens for Long Video Generation☆57Dec 10, 2025Updated 3 months ago
- Official Implementation of Rethinking Score Distillation as a Bridge Between Image Distributions☆85Mar 26, 2025Updated 11 months ago
- ☆19Dec 4, 2025Updated 3 months ago
- [Nips 2025] EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆128Jul 31, 2025Updated 7 months ago
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated last year
- [SIGGRAPH 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control☆812Jun 9, 2025Updated 9 months ago
- Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)☆3,230Sep 12, 2025Updated 6 months ago
- Official implementation of "EG4D: Explicit Generation of 4D Object without Score Distillation" (ICLR 2025)☆36Feb 14, 2025Updated last year
- A Survey of Efficient Attention Methods: Hardware-efficient, Sparse, Compact, and Linear Attention☆287Dec 1, 2025Updated 3 months ago
- ☆13Dec 17, 2025Updated 3 months ago
- official implementation of [PRIMAL: Physically Reactive and Interactive Motor Model for Avatar Learning, ICCV'25]☆35Oct 31, 2025Updated 4 months ago
- Official repository for "RLVR-World: Training World Models with Reinforcement Learning" (NeurIPS 2025), https://arxiv.org/abs/2505.13934☆232Oct 28, 2025Updated 4 months ago
- Official implementation of the paper "EgoPet: Egomotion and Interaction Data from an Animal's Perspective".☆29Dec 15, 2025Updated 3 months ago
- Estimate dense depth maps from RGB image and sparse depth maps☆14Oct 4, 2018Updated 7 years ago