VisionBrain / Neural_Voice_CloningLinks
Open Source Implementation of Neural Voice Cloning with Few Audio Samples (Baidu Research)
☆17Updated 4 years ago
Alternatives and similar repositories for Neural_Voice_Cloning
Users that are interested in Neural_Voice_Cloning are comparing it to the libraries listed below
Sorting:
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 6 years ago
- Implementation of MelNet in PyTorch to generate high-fidelity audio samples☆24Updated 4 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated last year
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Updated 7 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆27Updated 4 years ago
- Interface for Controllable Expressive Talking Machine☆38Updated last year
- Tools for working with the CMU Pronunciation Dictionary☆35Updated 7 years ago
- Demo and samples for universal speech translator☆23Updated 2 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- Deep CNN networks for Speech Synthesis☆49Updated 7 years ago
- (pytorch) multi speaker TTS,☆68Updated 5 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆29Updated 2 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Updated 4 years ago
- Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021☆40Updated 4 years ago
- Extract frequency, power, width and dissonance of formants from wav files☆26Updated 3 years ago
- Voice Conversion using Tacotron.☆11Updated 2 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆52Updated 3 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 3 years ago
- Phoneme prediction from speech mel-spectrograms using RNN.☆14Updated 6 years ago
- Source code for INTERSPEECH2020☆11Updated 4 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated 2 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Updated 3 years ago
- A Text2Speech Engine built in Pytorch.☆12Updated 6 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- 🐸TTS recipes for different datasets☆86Updated 2 years ago
- A Generative Adversarial Network for Shakuhachi Music☆14Updated 6 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 4 years ago
- ASR project with pytorch-lightning☆20Updated 3 months ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15Updated 5 years ago