Audio driven video synthesis
☆40Aug 11, 2022Updated 3 years ago
Alternatives and similar repositories for voicepuppet
Users that are interested in voicepuppet are comparing it to the libraries listed below
Sorting:
- Final Project for Stanford Deep Generative Modeling Class CS236.☆13Dec 14, 2019Updated 6 years ago
- Face Parsing via SegNeXt, trained on CelebAMask-HQ☆16Dec 21, 2023Updated 2 years ago
- PersonaTalk Hack☆15Jan 10, 2025Updated last year
- wav2lip in a Vector Quantized (VQ) space☆27Jun 20, 2023Updated 2 years ago
- Crystal TTVS engine is a real-time audio-visual Multilingual speech synthesizer with a 3D expressive avatar.☆87Aug 17, 2020Updated 5 years ago
- ☆14Nov 25, 2025Updated 3 months ago
- ☆15Jan 11, 2024Updated 2 years ago
- An improved version of APB2Face: Real-Time Audio-Guided Multi-Face Reenactment☆84Oct 7, 2021Updated 4 years ago
- ☆208Mar 10, 2021Updated 5 years ago
- 基于DINet的推理服务,推理视频流和视频☆17Nov 8, 2023Updated 2 years ago
- ☆72Jun 4, 2023Updated 2 years ago
- PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper☆12Mar 4, 2022Updated 4 years ago
- Motion Retargeting Video Subjects, Modified Colab Version by stanleyshly☆13Nov 24, 2021Updated 4 years ago
- Simple local all-in-one install for IDEA2.ART☆26Jan 8, 2023Updated 3 years ago
- Some recent state-of-the-art generative models in ONE notebook: (MIX-)?(GAN|WGAN|BigGAN|MHingeGAN|AMGAN|StyleGAN|StyleGAN2)(\+ADA|\+CR|\+…☆18Oct 19, 2020Updated 5 years ago
- This repository is an offical PyTorch implementation of SD-GAN: Semantic Decomposition for Face Image Synthesis with Discrete Attribute.☆13Mar 18, 2024Updated 2 years ago
- A repository for generating stylized talking 3D and 3D face☆279Nov 11, 2021Updated 4 years ago
- Code for "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose" (Arxiv 2020) and "Predicting Personalize…☆776Dec 15, 2023Updated 2 years ago
- "Analyzing and Improving the Image Quality of StyleGAN" in TensorFlow 2☆19Jan 9, 2021Updated 5 years ago
- R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning☆81Jan 3, 2024Updated 2 years ago
- AudioDVP:Photorealistic Audio-driven Video Portraits☆300Feb 27, 2024Updated 2 years ago
- FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.☆383Jun 30, 2022Updated 3 years ago
- Wav2Lip-Emotion extends Wav2Lip to modify facial expressions of emotions via L1 reconstruction and pre-trained emotion objectives. We als…☆97May 23, 2022Updated 3 years ago
- 🇯🇵📰🗻 NHK News Web (Easy) word frequency (core list) scraper for Japanese language learners.☆15Sep 19, 2025Updated 6 months ago
- Code for the Expression Packing algorithm to be published in Eurographics 2020☆16May 27, 2020Updated 5 years ago
- singing voice conversion without f0☆23May 10, 2023Updated 2 years ago
- CVPR 2019☆259May 24, 2023Updated 2 years ago
- ☆28Oct 1, 2023Updated 2 years ago
- A Ruby Gem for interacting with Android API from within Termux☆18Mar 24, 2019Updated 6 years ago
- ☆34Jan 4, 2022Updated 4 years ago
- Code for paper 'Audio-Driven Emotional Video Portraits'.☆314Mar 16, 2022Updated 4 years ago
- Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code☆109May 1, 2022Updated 3 years ago
- ☆14Sep 28, 2024Updated last year
- [ICCV2023] Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video☆76Mar 28, 2024Updated last year
- [SIGGRAPH 2024] InvertAvatar: Incremental GAN Inversion for Generalized Head Avatars☆58Jul 22, 2024Updated last year
- Official code and dataset release for "JAFPro: Joint Appearance Fusion and Propagation for Human Video Motion Transfer from Multiple Refe…☆14Jul 5, 2021Updated 4 years ago
- Considering the original Wav2Lip was trained on LSR2 and didn't have good performance on Chinese. I preprocessed CMLR Dataset and would t…☆63Sep 23, 2023Updated 2 years ago
- [Preprint'23] "Efficient Meshy Neural Fields for Animatable Human Avatars" https://arxiv.org/abs/2303.12965☆25Sep 30, 2024Updated last year
- Audio-Visual Generative Adversarial Network for Face Reenactment☆158Sep 11, 2025Updated 6 months ago