XinBow99 / Real-Time-Wav2Lip-implementationLinks
This project is a real-time Wav2Lip implementation that I am actively optimizing to enhance the precision and performance of audio-to-lip synchronization.
☆11Updated 2 years ago
Alternatives and similar repositories for Real-Time-Wav2Lip-implementation
Users that are interested in Real-Time-Wav2Lip-implementation are comparing it to the libraries listed below
Sorting:
- A batched implementation for efficient Qwen2.5-VL inference.☆20Updated 5 months ago
- wip - running some training with overfitting - https://wandb.ai/snoozie/vasa-overfitting☆309Updated last month
- The "virtual_human_stream" project is a real-time digital human system supporting audio-video dialogue. It integrates models like ernerf,…☆16Updated last year
- [CVPR 2023] CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior☆607Updated 2 years ago
- Code of SIGGRAPH 2023 Conference paper: StyleAvatar: Real-time Photo-realistic Portrait Avatar from a Single Video☆488Updated 2 years ago
- The API server version of the SadTalker project. Runs in Docker, 10 times faster than the original!☆144Updated 2 years ago
- Faster Talking Face Animation on Xeon CPU☆130Updated 2 years ago
- Offical implement of Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for talking head Video Generation☆237Updated 2 months ago
- Alternative to Flawless AI's TrueSync. Make lips in video match provided audio using the power of Wav2Lip and GFPGAN.☆128Updated 3 weeks ago
- PyTorch Implementation for Paper "Emotionally Enhanced Talking Face Generation" (ICCVW'23 and ACM-MMW'23)☆379Updated last year
- ☆81Updated 5 months ago
- ☆200Updated 2 years ago
- ☆38Updated 2 years ago
- [CVPR 2023] DPE: Disentanglement of Pose and Expression for General Video Portrait Editing☆452Updated last year
- Pytorch official implementation for our paper "HyperLips: Hyper Control Lips with High Resolution Decoder for Talking Face Generation".☆214Updated last year
- 🤢 LipSick: Fast, High Quality, Low Resource Lipsync Tool 🤮☆223Updated last year
- Code for One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning (AAAI 2022)☆359Updated 2 years ago
- Official code of CVPR '23 paper "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"☆323Updated 2 years ago
- [ECCV'24] TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting☆368Updated 9 months ago
- Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars☆392Updated 9 months ago
- [ICLR 2024] Generalizable and Precise Head Avatar from Image(s)☆342Updated last year
- High-Fidelity Lip-Syncing with Wav2Lip and Real-ESRGAN☆498Updated last year
- Full version of wav2lip-onnx including face alignment and face enhancement and more...☆150Updated 7 months ago
- This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"☆404Updated last year
- ☆43Updated 5 months ago
- This is a pytorch implementation of the following paper: AniPortraitGAN: Animatable 3D Portrait Generation from 2D Image Collections, SI…☆316Updated 9 months ago
- [CVPR 2023] MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation☆547Updated 2 years ago
- [ACM MM 2025] Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis☆653Updated 2 months ago
- Generate ARKit expression from audio in realtime☆175Updated 2 months ago
- [NeurIPS 2024] Generalizable and Animatable Gaussian Head Avatar☆546Updated 9 months ago