mowshon / lipsync
lipsync is a simple and updated Python library for lip synchronization, based on Wav2Lip. It synchronizes lips in videos and images based on provided audio, supports CPU/CUDA, and uses caching for faster processing.
☆114Updated 2 months ago
Alternatives and similar repositories for lipsync:
Users that are interested in lipsync are comparing it to the libraries listed below
- Pytorch official implementation for our paper "HyperLips: Hyper Control Lips with High Resolution Decoder for Talking Face Generation".☆206Updated last year
- Full version of wav2lip-onnx including face alignment and face enhancement and more...☆95Updated 2 months ago
- Faster Talking Face Animation on Xeon CPU☆126Updated last year
- An optimized pipeline for DINet reducing inference latency for up to 60% 🚀. Kudos for the authors of the original repo for this amazing …☆105Updated last year
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.☆68Updated 9 months ago
- PyTorch implementation of "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"☆211Updated last year
- Wav2Lip-Emotion extends Wav2Lip to modify facial expressions of emotions via L1 reconstruction and pre-trained emotion objectives. We als…☆96Updated 2 years ago
- This is a project about talking faces. We use 576X576 sized facial images for training, which can generate 2k, 4k, 6k, and 8k digital hum…☆52Updated last year
- ☆124Updated 10 months ago
- R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning☆80Updated last year
- 这是一个在wav2lip,使用wav2lip、gfpgan、yolov5等模型用RT加速的超快推理!经测试在2070显卡上可达到0.03秒每帧实现实时推理。☆27Updated last year
- Simple and fast wav2lip using new 256x256 resolution trained onnx-converted model for inference. Easy installation☆38Updated 6 months ago
- One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior, CVPRW 2024☆60Updated 5 months ago
- ☆199Updated last year
- PyTorch Implementation for Paper "Emotionally Enhanced Talking Face Generation" (ICCVW'23 and ACM-MMW'23)☆363Updated 3 months ago
- ☆24Updated 3 years ago
- The API server version of the SadTalker project. Runs in Docker, 10 times faster than the original!☆133Updated last year
- Wav2Lip UHQ Improvement with ControlNet 1.1☆73Updated last year
- Official code of CVPR '23 paper "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"☆315Updated last year
- A curated list of resources of audio-driven talking face generation☆141Updated 2 years ago
- Audio-Visual Generative Adversarial Network for Face Reenactment☆157Updated 11 months ago
- ☆417Updated last year
- Considering the original Wav2Lip was trained on LSR2 and didn't have good performance on Chinese. I preprocessed CMLR Dataset and would t…☆60Updated last year
- Updated fork of wav2lip-hq allowing for the use of current ESRGAN models☆54Updated 11 months ago
- ☆52Updated last year
- ☆34Updated last year
- ☆32Updated 2 months ago
- 优化wav2lip的执行步骤,将头脸分离、嘴型替换、回补背景三个步骤分离,添加gfpgan强化面部功能,实现提前解帧,流式循环处理,对接obs☆69Updated 3 months ago
- ☆40Updated last year
- The code for the paper "Speech Driven Talking Face Generation from a Single Image and an Emotion Condition"☆169Updated 2 years ago