JeffWang987 / WorldDreamerLinks
WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens
☆201Updated 2 years ago
Alternatives and similar repositories for WorldDreamer
Users that are interested in WorldDreamer are comparing it to the libraries listed below
Sorting:
- ☆144Updated last year
- minisora-DiT, a DiT reproduction based on XTuner from the open source community MiniSora☆40Updated last year
- UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing☆115Updated 9 months ago
- VideoAuteur: Towards Long Narrative Video Generation☆43Updated 2 months ago
- ☆52Updated last year
- [NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generation☆231Updated last year
- [IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort☆150Updated last year
- ☆66Updated last year
- [NeurIPS 2024] VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models☆171Updated last year
- Official implementation of our paper: "Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing" …☆77Updated 8 months ago
- [ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper☆164Updated last year
- Finetuning and inference tools for the CogView4 and CogVideoX model series.☆111Updated 8 months ago
- ☆114Updated last year
- [ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences☆321Updated last year
- The HD-VG-130M Dataset☆120Updated last year
- 🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)☆188Updated last year
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆132Updated last year
- ☆130Updated last year
- ☆91Updated last year
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆73Updated last year
- ICML2025, I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models☆192Updated 4 months ago
- [SIGGRAPH 2024] Motion I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling☆187Updated last year
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆80Updated last year
- ☆94Updated 9 months ago
- [ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos☆463Updated 9 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆304Updated 10 months ago
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆49Updated 5 months ago
- [CVPR 2025] A Hierarchical Movie Level Dataset for Long Video Generation☆80Updated 10 months ago
- [CVPR2024] MotionEditor is the first diffusion-based model capable of video motion editing.☆186Updated 4 months ago
- ☆141Updated 3 months ago