Rudrabha / Wav2LipLinks
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
☆12,087Updated this week
Alternatives and similar repositories for Wav2Lip
Users that are interested in Wav2Lip are comparing it to the libraries listed below
Sorting:
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆7,086Updated 10 months ago
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆12,890Updated 11 months ago
- [CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.☆3,579Updated last year
- Wav2Lip UHQ extension for Automatic1111☆1,383Updated last year
- GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code☆2,608Updated 8 months ago
- 本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇…☆1,968Updated 2 years ago
- High quality Lip sync☆1,122Updated 10 months ago
- Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation (SIGGRAPH Asia 2021)☆1,262Updated 2 years ago
- MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting☆4,310Updated 2 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆22,147Updated 3 months ago
- one-click face swap☆29,953Updated 10 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆5,799Updated 10 months ago
- Industry leading face manipulation platform☆23,475Updated this week
- This repository contains the source code for the paper First Order Motion Model for Image Animation☆14,874Updated 7 months ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆40,844Updated 10 months ago
- Official implementation of AnimateDiff.☆11,520Updated 10 months ago
- The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."☆1,075Updated last year
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆32,668Updated 2 months ago
- GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code☆1,710Updated 8 months ago
- 🔊 Text-Prompted Generative Audio Model☆38,063Updated 10 months ago
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆3,308Updated last year
- Inference and training library for high-quality TTS models.☆5,314Updated 6 months ago
- Bring portraits to life!☆16,277Updated last week
- Zero-Shot Speech Editing and Text-to-Speech in the Wild☆8,299Updated 3 months ago
- http://www.facegood.cc☆1,877Updated 2 years ago
- Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions☆7,635Updated 10 months ago
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆11,584Updated last week
- Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key☆8,504Updated last month
- An Open Source text-to-speech system built by inverting Whisper.☆4,288Updated 2 weeks ago
- ☆1,754Updated 2 months ago