mowshon / lipsync
lipsync is a simple and updated Python library for lip synchronization, based on Wav2Lip. It synchronizes lips in videos and images based on provided audio, supports CPU/CUDA, and uses caching for faster processing.
☆106Updated last month
Alternatives and similar repositories for lipsync:
Users that are interested in lipsync are comparing it to the libraries listed below
- Wav2Lip-Emotion extends Wav2Lip to modify facial expressions of emotions via L1 reconstruction and pre-trained emotion objectives. We als…☆94Updated 2 years ago
- Pytorch official implementation for our paper "HyperLips: Hyper Control Lips with High Resolution Decoder for Talking Face Generation".☆199Updated 11 months ago
- Full version of wav2lip-onnx including face alignment and face enhancement and more...☆86Updated last week
- PyTorch implementation of "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"☆211Updated last year
- An optimized pipeline for DINet reducing inference latency for up to 60% 🚀. Kudos for the authors of the original repo for this amazing …☆105Updated last year
- Faster Talking Face Animation on Xeon CPU☆124Updated last year
- PyTorch Implementation for Paper "Emotionally Enhanced Talking Face Generation" (ICCVW'23 and ACM-MMW'23)☆358Updated last month
- Audio-Visual Generative Adversarial Network for Face Reenactment☆157Updated 9 months ago
- ☆125Updated 9 months ago
- ☆24Updated 3 years ago
- This is a project about talking faces. We use 576X576 sized facial images for training, which can generate 2k, 4k, 6k, and 8k digital hum…☆51Updated 11 months ago
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.☆68Updated 7 months ago
- A curated list of resources of audio-driven talking face generation☆141Updated 2 years ago
- Mocap Dataset of “Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation”☆158Updated 3 years ago
- Speech to Facial Animation using GANs☆41Updated 3 years ago
- Official code of CVPR '23 paper "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"☆310Updated last year
- R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning☆80Updated last year
- ☆196Updated last year
- 这是一个在wav2lip,使用wav2lip、gfpgan、yolov5等模型用RT加速的超快推理!经测试在2070显卡上可达到0.03秒每帧实现实时推理。☆27Updated last year
- ☆32Updated 3 years ago
- Aim to accelerate the image-animation-model inference through the inference frameworks such as onnx、tensorrt and openvino.☆76Updated 11 months ago
- optimized wav2lip☆19Updated last year
- [ICCV 2023]ToonTalker: Cross-Domain Face Reenactment☆115Updated 3 months ago
- The API server version of the SadTalker project. Runs in Docker, 10 times faster than the original!☆128Updated last year
- Cloned repository from Hugging Face Spaces (CVPR 2022 Demo)☆55Updated 2 years ago
- One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior, CVPRW 2024☆60Updated 3 months ago
- Alternative to Flawless AI's TrueSync. Make lips in video match provided audio using the power of Wav2Lip and GFPGAN.☆117Updated 7 months ago
- Simple and fast wav2lip using new 256x256 resolution trained onnx-converted model for inference. Easy installation☆38Updated 4 months ago
- The code for the paper "Speech Driven Talking Face Generation from a Single Image and an Emotion Condition"☆168Updated last year
- 📖 A curated list of resources dedicated to avatar.☆58Updated 3 months ago