Rudrabha / Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
☆11,851Updated 3 weeks ago
Alternatives and similar repositories for Wav2Lip:
Users that are interested in Wav2Lip are comparing it to the libraries listed below
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆7,028Updated 9 months ago
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆12,701Updated 10 months ago
- [CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.☆3,564Updated last year
- 本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇…☆1,967Updated last year
- GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code☆2,600Updated 6 months ago
- MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting☆4,059Updated 2 weeks ago
- High quality Lip sync☆1,112Updated 9 months ago
- http://www.facegood.cc☆1,870Updated 2 years ago
- 🔊 Text-Prompted Generative Audio Model☆37,722Updated 8 months ago
- Wav2Lip UHQ extension for Automatic1111☆1,365Updated 10 months ago
- Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation (SIGGRAPH Asia 2021)☆1,256Updated last year
- Industry leading face manipulation platform☆22,811Updated this week
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆11,349Updated last month
- Taming Stable Diffusion for Lip Sync!☆3,904Updated last week
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆21,943Updated last month
- A multi-voice TTS system trained with an emphasis on quality☆14,086Updated 5 months ago
- The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."☆1,059Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/☆7,864Updated last year
- An arbitrary face-swapping framework on images and videos with one single trained model!☆4,850Updated 9 months ago
- An unofficial PyTorch implementation of the audio LM VALL-E☆2,990Updated 2 years ago
- [AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning☆3,839Updated 5 months ago
- A simple and open-source analogue of the HeyGen system☆949Updated 9 months ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆39,821Updated 8 months ago
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆3,290Updated 11 months ago
- GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code☆1,695Updated 6 months ago
- Colab for making Wav2Lip high quality and easy to use☆804Updated 11 months ago
- [CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation☆3,664Updated 2 months ago
- 📖 A curated list of resources dedicated to talking face.☆1,491Updated 4 months ago
- MARS5 speech model (TTS) from CAMB.AI☆2,753Updated 9 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆5,714Updated 9 months ago