mowshon / lipsync
lipsync is a simple and updated Python library for lip synchronization, based on Wav2Lip. It synchronizes lips in videos and images based on provided audio, supports CPU/CUDA, and uses caching for faster processing.
☆110Updated 2 months ago
Alternatives and similar repositories for lipsync:
Users that are interested in lipsync are comparing it to the libraries listed below
- Pytorch official implementation for our paper "HyperLips: Hyper Control Lips with High Resolution Decoder for Talking Face Generation".☆202Updated last year
- Wav2Lip-Emotion extends Wav2Lip to modify facial expressions of emotions via L1 reconstruction and pre-trained emotion objectives. We als…☆96Updated 2 years ago
- An optimized pipeline for DINet reducing inference latency for up to 60% 🚀. Kudos for the authors of the original repo for this amazing …☆106Updated last year
- Full version of wav2lip-onnx including face alignment and face enhancement and more...☆93Updated last month
- PyTorch implementation of "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"☆211Updated last year
- Faster Talking Face Animation on Xeon CPU☆125Updated last year
- The code for the paper "Speech Driven Talking Face Generation from a Single Image and an Emotion Condition"☆169Updated last year
- PyTorch Implementation for Paper "Emotionally Enhanced Talking Face Generation" (ICCVW'23 and ACM-MMW'23)☆361Updated 2 months ago
- A curated list of resources of audio-driven talking face generation☆141Updated 2 years ago
- ☆124Updated 10 months ago
- Official code of CVPR '23 paper "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"☆313Updated last year
- Audio-Visual Generative Adversarial Network for Face Reenactment☆157Updated 11 months ago
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.☆68Updated 8 months ago
- Mocap Dataset of “Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation”☆158Updated 3 years ago
- 这是一个在wav2lip,使用wav2lip、gfpgan、yolov5等模型用RT加速的超快推理!经测试在2070显卡上可达到0.03秒每帧实现实时推理。☆27Updated last year
- ☆514Updated last year
- ☆197Updated last year
- ☆24Updated 3 years ago
- R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning☆80Updated last year
- The API server version of the SadTalker project. Runs in Docker, 10 times faster than the original!☆130Updated last year
- One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior, CVPRW 2024☆60Updated 4 months ago
- ☆34Updated last year
- Using Claude Sonnet 3.5 to forward (reverse) engineer code from VASA white paper - WIP - (this is for La Raza 🎷)☆280Updated 4 months ago
- code for paper "Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion" in the conference of IJCAI 2021☆342Updated last year
- 📖 A curated list of resources dedicated to avatar.☆58Updated 4 months ago
- ☆419Updated last year
- Code for One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning (AAAI 2022)☆356Updated 2 years ago
- The official code of our ICCV2023 work: Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head video G…☆250Updated last year
- Cloned repository from Hugging Face Spaces (CVPR 2022 Demo)☆54Updated 2 years ago
- 🤢 LipSick: Fast, High Quality, Low Resource Lipsync Tool 🤮☆203Updated 8 months ago