VisionBrain / Neural_Voice_Cloning
Open Source Implementation of Neural Voice Cloning with Few Audio Samples (Baidu Research)
☆16Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for Neural_Voice_Cloning
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 5 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated last year
- Ossian: A simple language-independent Text-to-speech frontend☆17Updated 6 years ago
- Implementation of MelNet in PyTorch to generate high-fidelity audio samples☆23Updated 4 years ago
- WaveNet Vocoder Samples☆23Updated 5 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Updated 6 years ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- A neural network for filtering target speaker's voice from audio written in tensorflow☆21Updated 6 years ago
- C++ Implementation of the Information Bottleneck System☆23Updated 5 years ago
- Source code for INTERSPEECH2020☆11Updated 4 years ago
- Demo and samples for universal speech translator☆22Updated 2 years ago
- Voice emotion conversion model for DS/ML master's thesis. F0 contour mapping in sequence-to-sequence RNN-LSTM architecture in Tensorflow.☆26Updated 6 years ago
- A Text2Speech Engine built in Pytorch.☆11Updated 5 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆24Updated last year
- Interface for Controllable Expressive Talking Machine☆38Updated 10 months ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 2 years ago
- PyTorch based speaker embedding model☆15Updated 7 months ago
- Interface for using TTS and vocoder models in the form of a text editor☆19Updated 2 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆46Updated last year
- (pytorch) multi speaker TTS,☆65Updated 5 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆38Updated 4 years ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆23Updated 2 years ago
- A tensorflow based implementation of DeepVoice3 https://arxiv.org/abs/1710.07654☆13Updated 6 years ago
- Tensorflow implementation of Nvidia Waveglow☆41Updated 5 years ago