Tencent / HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
☆9,379Updated 2 weeks ago
Alternatives and similar repositories for HunyuanVideo:
Users that are interested in HunyuanVideo are comparing it to the libraries listed below
- OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340☆3,816Updated last month
- Official repository for LTX-Video☆3,189Updated 3 weeks ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆3,796Updated this week
- Wan: Open and Advanced Large-Scale Video Generative Models☆9,018Updated this week
- Various AI scripts. Mostly Stable Diffusion stuff.☆4,376Updated this week
- SkyReels V1: The first and most advanced open-source human-centric video foundation model☆1,884Updated 2 weeks ago
- The best OSS video generation models☆3,044Updated 2 months ago
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆2,862Updated 3 months ago
- ☆2,272Updated 2 weeks ago
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,086Updated 3 weeks ago
- ☆2,719Updated last week
- Video Generation Foundation Models: https://saiyan-world.github.io/goku/☆2,746Updated last month
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆11,019Updated this week
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,174Updated last week
- Taming Stable Diffusion for Lip Sync!☆3,317Updated this week
- Kolors Team☆4,296Updated 4 months ago
- [CVPR 2025] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis☆1,237Updated last week
- [CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text☆1,522Updated 3 months ago
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆3,197Updated 4 months ago
- Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25).☆8,574Updated 3 months ago
- Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding☆4,015Updated 2 months ago
- Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion☆1,268Updated this week
- Dead simple FLUX LoRA training UI with LOW VRAM support☆2,213Updated this week
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆2,282Updated 6 months ago
- ☆4,054Updated 2 weeks ago
- YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open☆4,547Updated last week
- ☆1,467Updated 3 months ago
- [CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System☆3,210Updated last month
- Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"☆1,467Updated 2 months ago
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,166Updated last month