Wan-Video / Wan2.1
Wan: Open and Advanced Large-Scale Video Generative Models
☆9,327Updated this week
Alternatives and similar repositories for Wan2.1:
Users that are interested in Wan2.1 are comparing it to the libraries listed below
- HunyuanVideo: A Systematic Framework For Large Video Generation Model☆9,454Updated 2 weeks ago
- Video Generation Foundation Models: https://saiyan-world.github.io/goku/☆2,755Updated last month
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆3,796Updated this week
- ☆2,734Updated 2 weeks ago
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,213Updated this week
- SkyReels V1: The first and most advanced open-source human-centric video foundation model☆1,915Updated 3 weeks ago
- OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340☆3,816Updated last month
- Official repository for LTX-Video☆3,221Updated 3 weeks ago
- Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆9,282Updated this week
- Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.☆16,356Updated 2 weeks ago
- Janus-Series: Unified Multimodal Understanding and Generation Models☆16,878Updated last month
- The best OSS video generation models☆3,044Updated 2 months ago
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,098Updated 3 weeks ago
- Taming Stable Diffusion for Lip Sync!☆3,317Updated last week
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆2,415Updated last week
- [CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System☆3,210Updated last month
- Official inference repo for FLUX.1 models☆21,048Updated last month
- ☆3,610Updated last month
- [CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation☆3,382Updated last month
- Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25).☆8,692Updated this week
- ☆4,087Updated 2 weeks ago
- YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open☆4,636Updated this week
- Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and pe…☆1,076Updated this week
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆2,862Updated 3 months ago
- Open-Sora: Democratizing Efficient Video Production for All☆25,800Updated this week
- Kolors Team☆4,296Updated 4 months ago
- ☆887Updated 2 weeks ago
- ☆2,292Updated 2 weeks ago
- Enjoy the magic of Diffusion models!☆8,110Updated this week
- DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding☆4,628Updated last month