Tencent / HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
☆7,425Updated this week
Alternatives and similar repositories for HunyuanVideo:
Users that are interested in HunyuanVideo are comparing it to the libraries listed below
- The best OSS video generation models☆2,718Updated last week
- Official repository for LTX-Video☆2,562Updated 2 weeks ago
- Code of Pyramidal Flow Matching for Efficient Video Generative Modeling☆2,701Updated 3 weeks ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆2,407Updated this week
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆3,805Updated this week
- OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340☆3,386Updated last month
- ☆1,562Updated this week
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆6,116Updated 3 months ago
- Kolors Team☆4,108Updated 2 months ago
- A general fine-tuning kit geared toward diffusion models.☆2,005Updated this week
- ☆893Updated last week
- Enjoy the magic of Diffusion models!☆6,742Updated this week
- FastVideo is a lightweight framework for accelerating large video diffusion models.☆859Updated this week
- Official inference repo for FLUX.1 models☆19,466Updated last week
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆2,092Updated 3 months ago
- Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"☆1,410Updated last week
- Various AI scripts. Mostly Stable Diffusion stuff.☆3,817Updated 2 weeks ago
- tiny vision language model☆6,732Updated this week
- More relighting!☆7,348Updated last month
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆1,680Updated this week
- Learning Flow Fields in Attention for Controllable Person Image Generation☆944Updated last week
- ☆1,353Updated last month
- ☆1,822Updated 2 months ago
- Taming Stable Diffusion for Lip Sync!☆1,856Updated this week
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆10,303Updated this week
- StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text☆1,481Updated last month
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,126Updated 5 months ago
- Latte: Latent Diffusion Transformer for Video Generation.☆1,756Updated 3 months ago
- Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System☆2,606Updated 2 weeks ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆8,947Updated this week