VisionBrain / Neural_Voice_Cloning
Open Source Implementation of Neural Voice Cloning with Few Audio Samples (Baidu Research)
☆16Updated 4 years ago
Alternatives and similar repositories for Neural_Voice_Cloning:
Users that are interested in Neural_Voice_Cloning are comparing it to the libraries listed below
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆56Updated 5 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated last year
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Updated 6 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- A crash course for training speech recognition models using DeepSpeech.☆24Updated 3 years ago
- WaveNet Vocoder Samples☆23Updated 5 years ago
- Deep CNN networks for Speech Synthesis☆49Updated 7 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- Ossian: A simple language-independent Text-to-speech frontend☆17Updated 6 years ago
- Interface for Controllable Expressive Talking Machine☆38Updated last year
- (pytorch) multi speaker TTS,☆67Updated 5 years ago
- Code for the paper "Investigating the effect of residual and highway connections in speech enhancement models"☆11Updated 5 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆52Updated 2 years ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- Mellotron singing synthesizer using CPU☆13Updated last year
- An implement of SPEECHSPLIT☆15Updated 4 years ago
- It is an algorithm analysed the acoustic features of a voice and creates an acoustic classifier - USEFUL for auto-speech-rater☆11Updated 5 years ago
- ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level…☆10Updated last month
- single channel speech separation for music vocal and accompany separate、voice reduce noise☆13Updated 5 years ago
- A tensorflow based implementation of DeepVoice3 https://arxiv.org/abs/1710.07654☆13Updated 6 years ago
- Extract frequency, power, width and dissonance of formants from wav files☆25Updated 2 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 4 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- An implementation of Tacotron2 (excluding WaveNet-vocoder) in TensorFlow.☆18Updated 6 years ago
- Demo and samples for universal speech translator☆23Updated 2 years ago
- (Si)mply a (Re)search front-end for Text-To-Speech Synthesis.☆10Updated 6 years ago
- Tacotron2 with BERT examples☆10Updated 5 years ago
- A neural network for filtering target speaker's voice from audio written in tensorflow☆21Updated 6 years ago
- Segmentation algorithms adapted for multitrack pianorolls☆10Updated 6 years ago