This is the repository containing codes for our CVPR, 2020 paper titled "Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis"
☆712Jul 6, 2023Updated 2 years ago
Alternatives and similar repositories for Lip2Wav
Users that are interested in Lip2Wav are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a PyTorch implementation of Lip2Wav☆50Oct 2, 2022Updated 3 years ago
- This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Transla…☆615Jun 22, 2025Updated 9 months ago
- This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Mult…☆12,916Jun 22, 2025Updated 9 months ago
- A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.☆94Jul 23, 2025Updated 8 months ago
- ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASS…☆433May 18, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆25Mar 9, 2024Updated 2 years ago
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆107May 27, 2024Updated last year
- Voice Conversion Challenge 2020 CycleVAE baseline system☆131Oct 19, 2020Updated 5 years ago
- A self-supervised learning framework for audio-visual speech☆981Dec 7, 2023Updated 2 years ago
- Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style tr…☆897Jul 6, 2023Updated 2 years ago
- Visual Speech Recognition for Multiple Languages☆465Aug 17, 2023Updated 2 years ago
- Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)☆813May 11, 2021Updated 4 years ago
- Code for paper 'Audio-Driven Emotional Video Portraits'.☆315Mar 16, 2022Updated 4 years ago
- ☆208Mar 10, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- processing and extracting of face and mouth image files out of the TCDTIMIT database☆46Sep 22, 2020Updated 5 years ago
- Our implementation of "Few-Shot Adversarial Learning of Realistic Neural Talking Head Models" (Egor Zakharov et al.)☆589Nov 22, 2022Updated 3 years ago
- ⏩ Generating speech in a single forward pass without any attention!☆580Mar 15, 2026Updated last month
- Out of time: automated lip sync in the wild☆881Updated this week
- A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.☆243Feb 15, 2024Updated 2 years ago
- This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…☆16Aug 26, 2020Updated 5 years ago
- Code for "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose" (Arxiv 2020) and "Predicting Personalize…☆776Dec 15, 2023Updated 2 years ago
- [Interspeech 2023] Intelligible Lip-to-Speech Synthesis with Speech Units☆47Oct 26, 2024Updated last year
- Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)☆959Jan 6, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ACCV 2020 "Speech2Video Synthesis with 3D Skeleton Regularization and Expressive Body Poses"☆100Feb 27, 2026Updated last month
- Unsupervised Speech Decomposition Via Triple Information Bottleneck☆699Oct 23, 2024Updated last year
- Official code for Cotatron @ INTERSPEECH 2020☆214Jul 25, 2024Updated last year
- Pytorch implementation for “V2C: Visual Voice Cloning”☆34Jan 28, 2023Updated 3 years ago
- Pytorch code for End-to-End Audiovisual Speech Recognition☆184Nov 18, 2022Updated 3 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆360Apr 27, 2022Updated 3 years ago
- This codebase demonstrates how to synthesize realistic 3D character animations given an arbitrary speech signal and a static character me…☆1,259Aug 20, 2024Updated last year
- Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders☆122Nov 21, 2022Updated 3 years ago
- The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxi…☆235Sep 21, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆166Apr 12, 2020Updated 6 years ago
- A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech☆127Jul 16, 2020Updated 5 years ago
- ObamaNet : Photo-realistic lip-sync from audio (Unofficial port)☆237Mar 28, 2018Updated 8 years ago
- PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)☆20Apr 11, 2022Updated 4 years ago
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…☆867Jul 22, 2023Updated 2 years ago
- Pytorch implementation for few-shot photorealistic video-to-video translation.☆1,795Oct 27, 2021Updated 4 years ago
- Extension of Wav2Lip repository for processing high-quality videos.☆547Feb 7, 2023Updated 3 years ago