alpoktem / MachineDubLinks
Automatic audiovisual translation with lip-syncing
☆10Updated 5 years ago
Alternatives and similar repositories for MachineDub
Users that are interested in MachineDub are comparing it to the libraries listed below
Sorting:
- A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech☆453Updated last year
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆362Updated 2 years ago
- Real-Time Lip Sync for Live 2D Animation☆142Updated 5 years ago
- ObamaNet : Photo-realistic lip-sync from audio (Unofficial port)☆239Updated 7 years ago
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆170Updated 4 years ago
- The code for the paper "Speech Driven Talking Face Generation from a Single Image and an Emotion Condition"☆170Updated 2 years ago
- ☆130Updated 2 years ago
- You Said That?: Synthesising Talking Faces from Audio☆69Updated 7 years ago
- ☆105Updated last year
- This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Transla…☆611Updated last month
- Unsupervised Speech Decomposition Via Triple Information Bottleneck☆691Updated 9 months ago
- One Shot Voice Cloning base on Unet-TTS☆242Updated 3 years ago
- Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code☆110Updated 3 years ago
- ☆21Updated 3 years ago
- PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.☆33Updated 4 months ago
- Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…☆236Updated 3 years ago
- Allosaurus is a pretrained universal phone recognizer for more than 2000 languages☆648Updated last year
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆129Updated 4 years ago
- Learning Lip Sync of Obama from Speech Audio☆66Updated 4 years ago
- lipsync is a simple and updated Python library for lip synchronization, based on Wav2Lip. It synchronizes lips in videos and images based…☆126Updated 6 months ago
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆108Updated last year
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆304Updated 3 years ago
- ☆34Updated 3 years ago
- Parallel data voice conversion based on pix2pix☆21Updated 5 years ago
- Audio driven video synthesis☆41Updated 2 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆351Updated 3 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆229Updated 3 years ago
- MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms☆229Updated 3 years ago
- AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss☆1,071Updated 9 months ago
- Penn Phonetics Lab Forced Aligner Toolkit (P2FA) for Python3☆104Updated last year