shawwn / Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
☆53Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for Real-Time-Voice-Cloning
- Pytorch implementation of Deepmind's WaveRNN model☆120Updated 5 years ago
- A gui to help make a text to speech dataset.☆18Updated last year
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 5 years ago
- Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu☆253Updated 3 years ago
- Non official project based on original /r/Deepfakes thread. Many thanks to him!☆15Updated 4 years ago
- This is a phonemic multilingual (Russian-English) Implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-S…☆51Updated 4 years ago
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆166Updated 4 years ago
- Stylizer Video to Drawing. Based on "Unpaired-Portrait-Drawing" repository.☆26Updated 4 years ago
- WIP: Open Source Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"☆249Updated 5 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆358Updated last year
- Simple text to phonemes converter for multiple languages☆20Updated last year
- This repository has implementation for "Neural Voice Cloning With Few Samples"☆428Updated 3 years ago
- MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms☆227Updated 2 years ago
- Speech-conditioned face generation using Generative Adversarial Networks (ICASSP 2019)☆56Updated 2 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆50Updated 2 years ago
- Face detection and recognition library that focuses on speed and ease of use.☆34Updated 3 years ago
- Classifying 10 different categories of Sound using Deep Learning.☆25Updated 6 years ago
- Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"☆208Updated 3 months ago
- Deepstory turns a text/generated text into a video where the character is animated to speak your story using his/her voice.☆95Updated last year
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated last year
- Voice Conversion using Cycle GAN's For Non-Parallel Data☆123Updated 5 years ago
- Two-stage GANs that generate fingerstyle guitarist images from audio.☆58Updated 6 years ago
- Image Processing, Speech Processing, Encoder Decoder, Research Paper implementation☆61Updated 4 years ago
- This is PyTorch Implementation of Neural Style Transfer Algorithm which is modified for Audios.☆79Updated 2 years ago
- GitHub repo for my Tensorflow World hackathon submission☆19Updated last year
- (pytorch) multi speaker TTS,☆64Updated 5 years ago
- end-to-end voicebot that answers open domain questions.☆10Updated 3 years ago
- ☆64Updated 3 years ago