JeffWang987 / WorldDreamerLinks
WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens
☆198Updated last year
Alternatives and similar repositories for WorldDreamer
Users that are interested in WorldDreamer are comparing it to the libraries listed below
Sorting:
- ☆144Updated last year
- ☆66Updated last year
- VideoAuteur: Towards Long Narrative Video Generation☆43Updated 2 weeks ago
- ☆51Updated 10 months ago
- UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing☆112Updated 6 months ago
- [NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generation☆228Updated last year
- minisora-DiT, a DiT reproduction based on XTuner from the open source community MiniSora☆40Updated last year
- [ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences☆318Updated last year
- [IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort☆151Updated 11 months ago
- The HD-VG-130M Dataset☆120Updated last year
- Finetuning and inference tools for the CogView4 and CogVideoX model series.☆100Updated 5 months ago
- [NeurIPS 2024] VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models☆164Updated last year
- 🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)☆183Updated last year
- ☆149Updated 10 months ago
- [ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper☆158Updated last year
- ☆108Updated last year
- A list of works on video generation towards world model☆172Updated 3 weeks ago
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆129Updated last year
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆108Updated last year
- Official implementation of our paper: "Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing" …☆71Updated 5 months ago
- [ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos☆431Updated 7 months ago
- ICML2025, I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models☆183Updated 2 months ago
- [SIGGRAPH 2024] Motion I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling☆183Updated last year
- ☆124Updated last year
- [ICLR 2025] VideoGrain: This repo is the official implementation of "VideoGrain: Modulating Space-Time Attention for Multi-Grained Video …☆155Updated 7 months ago
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆147Updated 2 weeks ago
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆92Updated 8 months ago
- [NeurIPS 2025] The official repository of "Sekai: A Video Dataset towards World Exploration"☆177Updated 3 months ago
- [ICCV 2025] MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance☆167Updated 2 weeks ago
- ☆129Updated 4 months ago