Rudrabha / Lip2WavView external linksLinks
This is the repository containing codes for our CVPR, 2020 paper titled "Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis"
☆713Jul 6, 2023Updated 2 years ago
Alternatives and similar repositories for Lip2Wav
Users that are interested in Lip2Wav are comparing it to the libraries listed below
Sorting:
- a PyTorch implementation of Lip2Wav☆50Oct 2, 2022Updated 3 years ago
- This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Transla…☆613Jun 22, 2025Updated 7 months ago
- This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Mult…☆12,823Jun 22, 2025Updated 7 months ago
- ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASS…☆431May 18, 2023Updated 2 years ago
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆25Mar 9, 2024Updated last year
- Voice Conversion Challenge 2020 CycleVAE baseline system☆131Oct 19, 2020Updated 5 years ago
- A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.☆93Jul 23, 2025Updated 6 months ago
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆108May 27, 2024Updated last year
- Our implementation of "Few-Shot Adversarial Learning of Realistic Neural Talking Head Models" (Egor Zakharov et al.)☆589Nov 22, 2022Updated 3 years ago
- Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style tr…☆900Jul 6, 2023Updated 2 years ago
- ⏩ Generating speech in a single forward pass without any attention!☆581Updated this week
- Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)☆816May 11, 2021Updated 4 years ago
- ☆208Mar 10, 2021Updated 4 years ago
- Code for paper 'Audio-Driven Emotional Video Portraits'.☆314Mar 16, 2022Updated 3 years ago
- Official code for Cotatron @ INTERSPEECH 2020☆214Jul 25, 2024Updated last year
- A self-supervised learning framework for audio-visual speech☆969Dec 7, 2023Updated 2 years ago
- Unsupervised Speech Decomposition Via Triple Information Bottleneck☆698Oct 23, 2024Updated last year
- Code for "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose" (Arxiv 2020) and "Predicting Personalize…☆774Dec 15, 2023Updated 2 years ago
- A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech☆127Jul 16, 2020Updated 5 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆359Apr 27, 2022Updated 3 years ago
- Code for ACCV 2020 "Speech2Video Synthesis with 3D Skeleton Regularization and Expressive Body Poses"☆100Apr 8, 2021Updated 4 years ago
- processing and extracting of face and mouth image files out of the TCDTIMIT database☆46Sep 22, 2020Updated 5 years ago
- Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)☆961Jan 6, 2024Updated 2 years ago
- Out of time: automated lip sync in the wild☆870Jan 23, 2024Updated 2 years ago
- Pytorch implementation for few-shot photorealistic video-to-video translation.☆1,798Oct 27, 2021Updated 4 years ago
- ☆967Sep 10, 2023Updated 2 years ago
- Pytorch code for End-to-End Audiovisual Speech Recognition☆184Nov 18, 2022Updated 3 years ago
- This codebase demonstrates how to synthesize realistic 3D character animations given an arbitrary speech signal and a static character me…☆1,247Aug 20, 2024Updated last year
- AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss☆1,091Oct 23, 2024Updated last year
- streaming attention networks for end-to-end automatic speech recognition☆55May 6, 2020Updated 5 years ago
- Extension of Wav2Lip repository for processing high-quality videos.☆549Feb 7, 2023Updated 3 years ago
- VQ-VAE for Acoustic Unit Discovery and Voice Conversion☆340Jul 6, 2023Updated 2 years ago
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…☆866Jul 22, 2023Updated 2 years ago
- Speech-conditioned face generation using Generative Adversarial Networks☆88Dec 8, 2022Updated 3 years ago
- Visual Speech Recognition for Multiple Languages☆458Aug 17, 2023Updated 2 years ago
- The Implementation of FastSpeech based on pytorch.☆880Jul 6, 2023Updated 2 years ago
- A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.☆240Feb 15, 2024Updated last year
- PPG-Based Voice Conversion☆347Jul 22, 2022Updated 3 years ago
- Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders☆122Nov 21, 2022Updated 3 years ago