shawwn / Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
☆53Updated 5 years ago
Alternatives and similar repositories for Real-Time-Voice-Cloning:
Users that are interested in Real-Time-Voice-Cloning are comparing it to the libraries listed below
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆358Updated last year
- Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu☆253Updated 3 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆56Updated 5 years ago
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆167Updated 4 years ago
- This repository has implementation for "Neural Voice Cloning With Few Samples"☆432Updated 3 years ago
- Keras implementations of Tacotron-2☆27Updated 4 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆121Updated 5 years ago
- A gui to help make a text to speech dataset.☆18Updated 2 years ago
- This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…☆12Updated 4 years ago
- A version of Obamanet that you won't go insane setting up.☆17Updated 2 years ago
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆80Updated 7 months ago
- VCTK multi-speaker tacotron for ICASSP 2020☆265Updated 2 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated last year
- Classifying 10 different categories of Sound using Deep Learning.☆25Updated 6 years ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆88Updated 3 years ago
- Audio Classification using Image Classification☆49Updated 4 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders☆120Updated 2 years ago
- Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Network☆10Updated 6 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆46Updated 2 years ago
- Animate image in real time using First Order Motion Model for Image Animation☆57Updated 11 months ago
- Identifying people from small audio fragments☆170Updated 4 years ago
- Image Processing, Speech Processing, Encoder Decoder, Research Paper implementation☆60Updated 4 years ago
- Real-Time Lip Sync for Live 2D Animation☆135Updated 5 years ago
- Text to Speech with PyTorch (English and Mongolian)☆185Updated 3 months ago
- Non official project based on original /r/Deepfakes thread. Many thanks to him!☆15Updated 4 years ago