Kyubyong / speaker_adapted_tts
Making a TTS model with 1 minute of speech samples within 10 minutes
☆184Updated 7 years ago
Alternatives and similar repositories for speaker_adapted_tts:
Users that are interested in speaker_adapted_tts are comparing it to the libraries listed below
- Speech Recognition Using Tacotron☆163Updated 7 years ago
- Tensorflow Implementation of Expressive Tacotron☆196Updated 6 years ago
- A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis☆205Updated 6 years ago
- Cross-lingual Voice Conversion☆97Updated 7 years ago
- A method to generate speech across multiple speakers☆872Updated 6 years ago
- End-2-end speech synthesis with recurrent neural networks☆226Updated last year
- RNN-based generative models for speech.☆611Updated 7 years ago
- Deep neural networks for getting text-independent speaker embedding written in TensorFlow☆309Updated 6 years ago
- C++ Code to run waveglow inference in cuda☆130Updated 5 years ago
- Tensorflow Implementation of Deep Voice 3☆452Updated 7 years ago
- Implementation of Google's Tacotron in TensorFlow☆236Updated 6 years ago
- A WaveNet-based vocoder for fast inference☆162Updated 6 years ago
- An opensource speech-to-text software written in tensorflow☆158Updated 2 years ago
- Deep Learning-based Voice Conversion system☆120Updated 2 years ago
- A Pytorch Implementation of ClariNet☆292Updated 5 years ago
- Identify a spoken language using artificial intelligence (LID).☆123Updated 6 years ago
- Upsample speech audio in wav format using deep learning☆192Updated 7 years ago
- Deep Voice Real-time Neural TTS System☆160Updated 8 years ago
- Tacotron 2 implementation☆87Updated 7 years ago
- PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)☆516Updated 4 years ago
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"☆367Updated 6 years ago
- Speech-to-text based on wav2letter built for transfer learning☆97Updated 2 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆470Updated 5 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆121Updated 5 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated 11 months ago
- Audio style transfer AI☆152Updated 5 months ago
- Speech recognition software where the neural net is trained with TensorFlow and GMM training and decoding is done in Kaldi☆170Updated 8 years ago
- Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.☆540Updated 3 weeks ago
- Wavenet and its applications with Tensorflow☆55Updated 6 years ago
- AI Drums:☆3Updated 6 years ago