本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
☆2,004Jun 4, 2023Updated 2 years ago
Alternatives and similar repositories for SadTalker-Video-Lip-Sync
Users that are interested in SadTalker-Video-Lip-Sync are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆13,826Jun 26, 2024Updated last year
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆7,247Aug 5, 2024Updated last year
- High quality Lip sync☆1,158Jul 30, 2024Updated last year
- The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."☆1,114Sep 25, 2023Updated 2 years ago
- GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code☆2,666Oct 18, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis☆1,251Mar 14, 2025Updated last year
- Wav2Lip version 288 and pipeline to train☆647Aug 13, 2025Updated 9 months ago
- This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Mult…☆12,997Jun 22, 2025Updated 11 months ago
- Wav2Lip UHQ extension for Automatic1111☆1,420Jun 14, 2024Updated last year
- Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition☆929Apr 4, 2024Updated 2 years ago
- CVPR2023 talking face implementation for Identity-Preserving Talking Face Generation With Landmark and Appearance Priors☆739Jan 6, 2024Updated 2 years ago
- [CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"☆1,629Sep 18, 2025Updated 8 months ago
- GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code☆1,813Oct 18, 2024Updated last year
- MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting☆5,765Sep 26, 2025Updated 7 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code☆1,092Oct 18, 2024Updated last year
- Colab for making Wav2Lip high quality and easy to use☆854May 17, 2024Updated 2 years ago
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆1,792Jan 15, 2024Updated 2 years ago
- [CVPR 2023] DPE: Disentanglement of Pose and Expression for General Video Portrait Editing☆455Feb 27, 2024Updated 2 years ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆5,019Jul 2, 2024Updated last year
- http://www.facegood.ai☆1,908Mar 11, 2026Updated 2 months ago
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,844Jun 28, 2024Updated last year
- High-Fidelity Lip-Syncing with Wav2Lip and Real-ESRGAN☆507Mar 27, 2024Updated 2 years ago
- ☆200Jun 30, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆429Nov 1, 2023Updated 2 years ago
- Official code of CVPR '23 paper "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"☆326Aug 8, 2023Updated 2 years ago
- ☆527Dec 26, 2023Updated 2 years ago
- This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".☆1,071Oct 27, 2023Updated 2 years ago
- VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior☆803Dec 5, 2023Updated 2 years ago
- [AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning☆4,233Apr 7, 2026Updated last month
- PyTorch implementation of "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"☆215Aug 8, 2023Updated 2 years ago
- [CVPR2023] The implementation for "DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation"☆472Jul 15, 2024Updated last year
- [ECCV 2022] StyleHEAT: A framework for high-resolution editable talking face generation☆658Mar 26, 2023Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation (SIGGRAPH Asia 2021)☆1,284Jun 19, 2023Updated 2 years ago
- [CVPR 2023] CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior☆617Sep 20, 2023Updated 2 years ago
- Chat with any character you like: ChatGLM2+SadTalker+Voice Cloning | 和喜欢的角色沉浸式对话吧:ChatGLM2+声音克隆+视频对话☆618Aug 11, 2023Updated 2 years ago
- PyTorch Implementation for Paper "Emotionally Enhanced Talking Face Generation" (ICCVW'23 and ACM-MMW'23)☆381Jan 12, 2025Updated last year
- Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LL…☆3,307Feb 10, 2026Updated 3 months ago
- fay是一个帮助数字人(2.5d、3d、移动、pc、网页)或大语言模型(openai兼容、deepseek)连通业务系统的agent框架。☆12,794Apr 27, 2026Updated 3 weeks ago
- The API server version of the SadTalker project. Runs in Docker, 10 times faster than the original!☆146Aug 2, 2023Updated 2 years ago