Edresson / TTSLinks
Deep learning for Text to Speech
☆27Updated 4 years ago
Alternatives and similar repositories for TTS
Users that are interested in TTS are comparing it to the libraries listed below
Sorting:
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆145Updated 2 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆62Updated 5 years ago
- Open Source Text-To-Speech Portuguese Dataset☆173Updated last year
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆143Updated 2 months ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated 2 years ago
- LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search☆92Updated 4 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆155Updated 4 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆123Updated 3 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆129Updated 4 years ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆106Updated 4 years ago
- Python toolkit for speech processing☆69Updated 3 weeks ago
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆85Updated 2 years ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆132Updated last year
- Implementation code of non-parallel sequence-to-sequence VC☆248Updated 2 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆121Updated 6 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated 2 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Updated 4 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆82Updated 3 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Updated last year
- VCTK multi-speaker tacotron for ICASSP 2020☆266Updated 3 years ago
- A pytroch implementation of the FB-MelGAN☆89Updated 5 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆61Updated 2 years ago
- ☆40Updated 3 years ago
- ☆80Updated 3 weeks ago
- Interface for Controllable Expressive Talking Machine☆38Updated last year
- ☆260Updated 2 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆34Updated 6 years ago
- Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.☆148Updated 3 years ago
- Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention☆202Updated 4 years ago
- Online streaming speaker change detection model in Pytorch☆42Updated 2 years ago