Dianezzy / ParaLip
Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code
☆106Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for ParaLip
- Project page for "Improving Few-shot Learning for Talking Face System with TTS Data Augmentation" for ICASSP2023☆83Updated last year
- Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices☆61Updated 7 months ago
- ☆19Updated 2 years ago
- Talking Head from Speech Audio using a Pre-trained Image Generator☆23Updated 6 months ago
- demo page https://MingjieChen.github.io/dygan-vc☆67Updated 2 years ago
- PyTorch implementation of "Lip to Speech Synthesis in the Wild with Multi-task Learning" (ICASSP2023)☆65Updated 8 months ago
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆22Updated 8 months ago
- The project page repo for Neural Dubber.☆29Updated last year
- PyTorch Implementation of Multi-Singer (ACM-MM'21)☆139Updated 2 years ago
- ☆48Updated last year
- SyncTalkFace: Talking Face Generation for Precise Lip-syncing via Audio-Lip Memory☆33Updated 2 years ago
- Demo for 2022 ICASSP☆64Updated 2 years ago
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆154Updated 4 years ago
- PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation☆190Updated 2 years ago
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆162Updated 7 months ago
- This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance No…☆111Updated 3 years ago
- Learning Lip Sync of Obama from Speech Audio☆67Updated 4 years ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆197Updated 2 years ago
- Official repository for the paper Multimodal Transformer Distillation for Audio-Visual Synchronization (ICASSP 2024).☆23Updated 7 months ago
- Tools for downloading VoxCeleb2 dataset☆26Updated 8 months ago
- An 16kHz implementation of HiFi-GAN for soft-vc.☆93Updated last year
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…☆158Updated 2 years ago
- ☆28Updated 4 years ago
- Official Implementation of StyleTTS-VC☆164Updated last year
- Code for paper 'EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model'☆186Updated last year
- ☆32Updated 2 years ago
- Official implementation of SpeechSplit2☆128Updated 2 years ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆81Updated last year
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆119Updated 2 years ago
- An improved version of APB2Face: Real-Time Audio-Guided Multi-Face Reenactment☆82Updated 3 years ago