JeffWang987 / WorldDreamer
WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens
☆192Updated 8 months ago
Related projects: ⓘ
- UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing☆87Updated 5 months ago
- ☆127Updated 3 months ago
- ☆235Updated last month
- ☆143Updated 3 weeks ago
- ☆183Updated this week
- VideoTetris: Towards Compositional Text-To-Video Generation☆197Updated 2 weeks ago
- ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)☆199Updated 2 months ago
- ☆93Updated 2 months ago
- Code repository for T2V-Turbo☆166Updated 2 months ago
- Multimodal Models in Real World☆372Updated 2 months ago
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆125Updated 7 months ago
- 🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)☆146Updated 5 months ago
- This is the official implementation of "Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams"☆105Updated last month
- "4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency", Yuyang Yin*, Dejia Xu*, Zhangyang Wang, Yao Zhao, Yunchao Wei☆211Updated 2 months ago
- ☆140Updated 2 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆196Updated last month
- The HD-VG-130M Dataset☆106Updated 5 months ago
- [SIGGRAPH 2024] Motion I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling☆87Updated 2 months ago
- I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models☆197Updated 8 months ago
- [NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT☆129Updated 4 months ago
- ☆335Updated 2 weeks ago
- [CVPR2024] VideoBooth: Diffusion-based Video Generation with Image Prompts☆251Updated 3 months ago
- Implementation of DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing☆230Updated last year
- ☆147Updated last year
- [CVPR2024] MotionEditor is the first diffusion-based model capable of video motion editing.☆129Updated 2 months ago
- Synthesizing Moving People with 3D Control☆119Updated 8 months ago
- This respository contains the code for AVID: Any-Length Video Inpainting with Diffusion Model.☆129Updated 6 months ago
- ☆168Updated 2 months ago
- An initiative to replicate Sora☆98Updated 5 months ago
- [IEEE TVCG 2024] Customized Video Generation Using Textual and Structural Guidance☆180Updated 6 months ago