JeffWang987 / WorldDreamerLinks
WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens
☆200Updated last year
Alternatives and similar repositories for WorldDreamer
Users that are interested in WorldDreamer are comparing it to the libraries listed below
Sorting:
- ☆141Updated last year
- VideoAuteur: Towards Long Narrative Video Generation☆43Updated 7 months ago
- minisora-DiT, a DiT reproduction based on XTuner from the open source community MiniSora☆40Updated last year
- ☆66Updated last year
- [ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos☆400Updated 5 months ago
- UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing☆110Updated 4 months ago
- [NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generation☆226Updated 9 months ago
- ☆50Updated 8 months ago
- Finetuning and inference tools for the CogView4 and CogVideoX model series.☆85Updated 3 months ago
- The HD-VG-130M Dataset☆119Updated last year
- The official repository of "Sekai: A Video Dataset towards World Exploration"☆140Updated last month
- [ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper☆156Updated last year
- [IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort☆152Updated 9 months ago
- [NeurIPS 2024] VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models☆161Updated 11 months ago
- A list of works on video generation towards world model☆164Updated 3 weeks ago
- [ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences☆317Updated last year
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆123Updated 4 months ago
- ☆136Updated 7 months ago
- GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning☆93Updated 3 months ago
- ☆106Updated last year
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆172Updated 5 months ago
- Official respository for ReasonGen-R1☆64Updated 2 months ago
- [ICLR 2025] VideoGrain: This repo is the official implementation of "VideoGrain: Modulating Space-Time Attention for Multi-Grained Video …☆147Updated 5 months ago
- Official implementation of our paper: "Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing" …☆67Updated 3 months ago
- [CVPR 25] A framework named B^2-DiffuRL for RL-based diffusion model fine-tuning.☆35Updated 5 months ago
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆45Updated 3 weeks ago
- ☆118Updated last year
- I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models☆179Updated 6 months ago
- ☆359Updated 10 months ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆80Updated last year