Rudrabha/Lip2Wav

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Rudrabha/Lip2Wav)

Rudrabha / Lip2Wav

This is the repository containing codes for our CVPR, 2020 paper titled "Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis"

☆713

Alternatives and similar repositories for Lip2Wav

Users that are interested in Lip2Wav are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

joannahong / Lip2Wav-pytorch
View on GitHub
a PyTorch implementation of Lip2Wav
☆50Oct 2, 2022Updated 3 years ago
Rudrabha / LipGAN
View on GitHub
This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Transla…
☆616Jun 22, 2025Updated last year
Rudrabha / Wav2Lip
View on GitHub
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Mult…
☆13,118Jun 22, 2025Updated last year
Chris10M / Lip2Speech
View on GitHub
A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.
☆95Jul 23, 2025Updated last year
mpc001 / Lipreading_using_Temporal_Convolutional_Networks
View on GitHub
ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASS…
☆438May 18, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
ms-dot-k / Visual-Context-Attentional-GAN
View on GitHub
PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)
☆25Mar 9, 2024Updated 2 years ago
Sindhu-Hegde / pseudo-visual-speech-denoising
View on GitHub
Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021
☆108May 27, 2024Updated 2 years ago
bigpon / vcc20_baseline_cyclevae
View on GitHub
Voice Conversion Challenge 2020 CycleVAE baseline system
☆131Oct 19, 2020Updated 5 years ago
facebookresearch / av_hubert
View on GitHub
A self-supervised learning framework for audio-visual speech
☆996Dec 7, 2023Updated 2 years ago
dunbar12138 / Audiovisual-Synthesis
View on GitHub
Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders
☆123Nov 21, 2022Updated 3 years ago
NVIDIA / flowtron
View on GitHub
Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style tr…
☆895Jul 6, 2023Updated 3 years ago
Hangz-nju-cuhk / Talking-Face-Generation-DAVS
View on GitHub
Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)
☆813May 11, 2021Updated 5 years ago
jixinya / EVP
View on GitHub
Code for paper 'Audio-Driven Emotional Video Portraits'.
☆314Mar 16, 2022Updated 4 years ago
lelechen63 / Talking-head-Generation-with-Rhythmic-Head-Motion
View on GitHub
☆209Mar 10, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
grey-eye / talking-heads
View on GitHub
Our implementation of "Few-Shot Adversarial Learning of Realistic Neural Talking Head Models" (Egor Zakharov et al.)
☆590Nov 22, 2022Updated 3 years ago
matthijsvk / TCDTIMITprocessing
View on GitHub
processing and extracting of face and mouth image files out of the TCDTIMIT database
☆47Sep 22, 2020Updated 5 years ago
axelspringer / ForwardTacotron
View on GitHub
⏩ Generating speech in a single forward pass without any attention!
☆578Mar 15, 2026Updated 4 months ago
joonson / syncnet_python
View on GitHub
Out of time: automated lip sync in the wild
☆894Apr 17, 2026Updated 3 months ago
smeetrs / deep_avsr
View on GitHub
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
☆244Feb 15, 2024Updated 2 years ago
mpc001 / Visual_Speech_Recognition_for_Multiple_Languages
View on GitHub
Visual Speech Recognition for Multiple Languages
☆478Aug 17, 2023Updated 2 years ago
yiranran / Audio-driven-TalkingFace-HeadPose
View on GitHub
Code for "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose" (Arxiv 2020) and "Predicting Personalize…
☆772Dec 15, 2023Updated 2 years ago
Rudrabha / 8X-Super-Resolution
View on GitHub
This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…
☆16Aug 26, 2020Updated 5 years ago
michaelzhang-ai / Speech2Video
View on GitHub
ACCV 2020 "Speech2Video Synthesis with 3D Skeleton Regularization and Expressive Body Poses"
☆100Feb 27, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
choijeongsoo / lip2speech-unit
View on GitHub
[Interspeech 2023] Intelligible Lip-to-Speech Synthesis with Speech Units
☆47Oct 26, 2024Updated last year
Hangz-nju-cuhk / Talking-Face_PC-AVS
View on GitHub
Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)
☆959Jan 6, 2024Updated 2 years ago
auspicious3000 / SpeechSplit
View on GitHub
Unsupervised Speech Decomposition Via Triple Information Bottleneck
☆697Oct 23, 2024Updated last year
chenqi008 / V2C
View on GitHub
Pytorch implementation for “V2C: Visual Voice Cloning”
☆34Jan 28, 2023Updated 3 years ago
TimoBolkart / voca
View on GitHub
This codebase demonstrates how to synthesize realistic 3D character animations given an arbitrary speech signal and a static character me…
☆1,258Aug 20, 2024Updated last year
VIPL-Audio-Visual-Speech-Understanding / LipNet-PyTorch
View on GitHub
The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxi…
☆237Sep 21, 2022Updated 3 years ago
maum-ai / cotatron
View on GitHub
Official code for Cotatron @ INTERSPEECH 2020
☆213Jul 25, 2024Updated 2 years ago
joonson / syncnet_trainer
View on GitHub
Disentangled Speech Embeddings using Cross-Modal Self-Supervision
☆167Apr 12, 2020Updated 6 years ago
mpc001 / end-to-end-lipreading
View on GitHub
Pytorch code for End-to-End Audiovisual Speech Recognition
☆183Nov 18, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Wendison / VQMIVC
View on GitHub
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
☆361Apr 27, 2022Updated 4 years ago
ms-dot-k / Visual-Audio-Memory
View on GitHub
PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)
☆22Apr 11, 2022Updated 4 years ago
yanggeng1995 / EATS
View on GitHub
A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech
☆127Jul 16, 2020Updated 6 years ago
NVlabs / few-shot-vid2vid
View on GitHub
Pytorch implementation for few-shot photorealistic video-to-video translation.
☆1,797Oct 27, 2021Updated 4 years ago
NVIDIA / mellotron
View on GitHub
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…
☆869Jul 22, 2023Updated 3 years ago
DinoMan / speech-driven-animation
View on GitHub
☆960Sep 10, 2023Updated 2 years ago
karanvivekbhargava / obamanet
View on GitHub
ObamaNet : Photo-realistic lip-sync from audio (Unofficial port)
☆237Mar 28, 2018Updated 8 years ago