joannahong / Lip2Wav-pytorchLinks

a PyTorch implementation of Lip2Wav

☆51

Alternatives and similar repositories for Lip2Wav-pytorch

Users that are interested in Lip2Wav-pytorch are comparing it to the libraries listed below

Sorting:

naver-ai / facetts
☆59Updated 2 years ago
ms-dot-k / Visual-Context-Attentional-GAN
PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)
☆25Updated last year
walkoncross / voxceleb2-download-zyf
Tools for downloading VoxCeleb2 dataset
☆33Updated last year
Chris10M / Lip2Speech
A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.
☆93Updated 4 months ago
ms-dot-k / Lip-to-Speech-Synthesis-in-the-Wild
PyTorch implementation of "Lip to Speech Synthesis in the Wild with Multi-task Learning" (ICASSP2023)
☆70Updated last year
zcxu-eric / AVA-AVD
☆47Updated 3 years ago
ms-dot-k / LRW_ID
The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…
☆10Updated 2 years ago
LUMIA-Group / Leveraging-Self-Supervised-Learning-for-AVSR
Official PyTorch implementation of paper Leveraging Unimodal Self Supervised Learning for Multimodal Audio-Visual Speech Recognition (ACL…
☆68Updated 3 years ago
ahaliassos / raven
Official implementation of RAVEn (ICLR 2023) and BRAVEn (ICASSP 2024)
☆77Updated 9 months ago
vskadandale / vocalist
Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices
☆68Updated last year
joonson / syncnet_trainer
Disentangled Speech Embeddings using Cross-Modal Self-Supervision
☆165Updated 5 years ago
ttslr / StrengthNet
[INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning
☆83Updated 3 years ago
DanielMengLiu / AudioVisualLip
☆23Updated last year
GalaxyCong / HPMDubbing
[CVPR 2023] Official code for paper: Learning to Dub Movies via Hierarchical Prosody Models.
☆110Updated last year
Dianezzy / ParaLip
Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code
☆109Updated 3 years ago
KimythAnly / AGAIN-VC
This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance No…
☆114Updated 4 years ago
chenqi008 / V2C
Pytorch implementation for “V2C: Visual Voice Cloning”
☆32Updated 2 years ago
CODEJIN / AutoVC
☆30Updated 5 years ago
choijeongsoo / lip2speech-unit
[Interspeech 2023] Intelligible Lip-to-Speech Synthesis with Speech Units
☆47Updated last year
biggytruck / SpeechSplit2
Official implementation of SpeechSplit2
☆133Updated 3 years ago
KunZhou9646 / Emovox
This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".
☆93Updated 3 years ago
KunZhou9646 / seq2seq-EVC
This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…
☆86Updated 2 years ago
Moon0316 / T2A
Project page for "Improving Few-shot Learning for Talking Face System with TTS Data Augmentation" for ICASSP2023
☆86Updated 2 years ago
arxrean / LipRead-seq2seq
An unofficial (PyTorch) implementation for the paper Deep Lip Reading: A comparison of models and an online application.
☆10Updated 5 years ago
Tinglok / CVC
CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)
☆59Updated 3 years ago
BenoitWang / Speech_Emotion_Diarization
☆69Updated last year
X-LANCE / MSDWILD
[INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.
☆58Updated last year
zexupan / reentry
☆18Updated last year
zcxu-eric / Ego4d_TalkNet_ASD
☆20Updated 3 years ago
KunZhou9646 / controllable_evc_code
This is the code for controllable EVC framework for seen and unseen emotion generation.
☆45Updated 4 years ago