JeffWang987 / WorldDreamer
WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens
☆193Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for WorldDreamer
- UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing☆91Updated 2 weeks ago
- Adaptive Caching for Faster Video Generation with Diffusion Transformers☆96Updated 2 weeks ago
- ☆134Updated 5 months ago
- [NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generation☆206Updated 2 weeks ago
- ☆93Updated 4 months ago
- ☆145Updated 2 months ago
- ☆193Updated 4 months ago
- Code repository for T2V-Turbo and T2V-Turbo-v2☆250Updated last month
- [NeurIPS 2024] VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models☆115Updated last month
- Multimodal Models in Real World☆404Updated 3 weeks ago
- ☆254Updated 3 months ago
- [SIGGRAPH 2024] Motion I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling☆112Updated last month
- Let's finetune video generation models!☆241Updated this week
- [Neurips 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching☆136Updated this week
- Official code for "ControlAR: Controllable Image Generation with Autoregressive Models"☆118Updated 2 weeks ago
- The HD-VG-130M Dataset☆109Updated 7 months ago
- ☆349Updated last month
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆126Updated 9 months ago
- 🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)☆155Updated 7 months ago
- ☆156Updated last year
- [ArXiv 2024] Follow-Your-Canvas: This repo is the official implementation of "Follow-Your-Canvas: Higher-Resolution Video Outpainting wit…☆92Updated last month
- ☆54Updated 3 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆84Updated 4 months ago
- This is the official implementation of "Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams"☆130Updated 3 months ago
- ☆127Updated 2 weeks ago
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆127Updated 5 months ago
- ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)☆218Updated 4 months ago
- A Training-free Iterative Framework for Long Story Visualization☆62Updated this week
- ☆176Updated 3 months ago