JeffWang987 / WorldDreamerLinks
WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens
☆198Updated last year
Alternatives and similar repositories for WorldDreamer
Users that are interested in WorldDreamer are comparing it to the libraries listed below
Sorting:
- ☆143Updated last year
- VideoAuteur: Towards Long Narrative Video Generation☆43Updated 9 months ago
- [NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generation☆228Updated 11 months ago
- minisora-DiT, a DiT reproduction based on XTuner from the open source community MiniSora☆40Updated last year
- ☆66Updated last year
- UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing☆110Updated 6 months ago
- ☆50Updated 10 months ago
- ☆107Updated last year
- Finetuning and inference tools for the CogView4 and CogVideoX model series.☆96Updated 5 months ago
- [NeurIPS 2024] VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models☆162Updated last year
- [ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper☆158Updated last year
- [IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort☆151Updated 10 months ago
- [ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences☆318Updated last year
- 🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)☆182Updated last year
- The HD-VG-130M Dataset☆120Updated last year
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆138Updated 6 months ago
- ☆359Updated 11 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆300Updated 7 months ago
- Official implementation of our paper: "Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing" …☆70Updated 4 months ago
- ☆143Updated 9 months ago
- [SIGGRAPH 2024] Motion I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling☆181Updated last year
- [NeurIPS 2025] The official repository of "Sekai: A Video Dataset towards World Exploration"☆164Updated 2 months ago
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆108Updated last year
- ☆89Updated last year
- [ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos☆424Updated 6 months ago
- [NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT☆135Updated last year
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆129Updated last year
- A list of works on video generation towards world model☆167Updated 2 months ago
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆174Updated 6 months ago
- ICML2025, I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models