Tencent-Hunyuan / HunyuanVideo-FoleyLinks
HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation.
☆185Updated this week
Alternatives and similar repositories for HunyuanVideo-Foley
Users that are interested in HunyuanVideo-Foley are comparing it to the libraries listed below
Sorting:
- Streamlining Cartoon Production with Generative Post-Keyframing☆384Updated 2 weeks ago
- [CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation☆288Updated 2 months ago
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers☆415Updated 2 weeks ago
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset☆257Updated 2 months ago
- Mobius: Text to Seamless Looping Video Generation via Latent Shift☆164Updated 3 months ago
- Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.☆558Updated this week
- Pusa: Thousands Timesteps Video Diffusion Model☆597Updated last week
- We achieves high-quality first-frame guided video editing given a reference image, while maintaining flexibility for incorporating additi…☆300Updated 2 weeks ago
- [SIGGRAPH 2025] Official code of the paper "Cobra: Efficient Line Art COlorization with BRoAder References". Cobra:利用更广泛参考图实现高效线稿上色☆209Updated 4 months ago
- Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation☆252Updated last month
- Official implementation for "Story2Board: A Training‑Free Approach for Expressive Storyboard Generation"☆145Updated last week
- [Official] Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off☆302Updated 2 weeks ago
- Calligrapher: Freestyle Text Image Customization☆280Updated last month
- ☆217Updated last month
- Official Implementation of DRA-Ctrl (Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis)☆118Updated 2 weeks ago
- DICE-Talk is a diffusion-based emotional talking head generation method that can generate vivid and diverse emotions for speaking portrai…☆247Updated 3 weeks ago
- The official code implementation of the paper "OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data."☆396Updated 2 months ago
- ☆91Updated 2 months ago
- SeedVR2: One-Step Video Restoration via Diffusion Adversarial Post-Training☆365Updated 2 months ago
- In-context subject-driven image generation while preserving foreground fidelity☆348Updated 2 months ago
- The best OSS video generation models☆134Updated 10 months ago
- KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution☆356Updated 3 weeks ago
- ICCV 2025 ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control…☆381Updated last week
- ☆279Updated 2 months ago
- [ICCV 2025] LayerAnimate: Layer-specific Control for Animation☆187Updated last week
- Any-to-Bokeh is a novel one-step video bokeh framework that converts arbitrary input videos into temporally coherent, depth-aware bokeh e…☆107Updated last month
- [ICCV 2025] Official Pytorch Implementation of FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait.☆375Updated 2 months ago
- Official PyTorch Implementation of "Optimal Stepsize for Diffusion Sampling".☆184Updated 4 months ago
- ☆80Updated 6 months ago
- All-round Creator and Editor☆234Updated 7 months ago