Edresson / TTSLinks
Deep learning for Text to Speech
☆27Updated 4 years ago
Alternatives and similar repositories for TTS
Users that are interested in TTS are comparing it to the libraries listed below
Sorting:
- Open Source Text-To-Speech Portuguese Dataset☆169Updated last year
- Interface for Controllable Expressive Talking Machine☆38Updated last year
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆143Updated last year
- Implementation code of non-parallel sequence-to-sequence VC☆248Updated 2 years ago
- Python toolkit for speech processing☆69Updated last week
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆120Updated 2 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆190Updated 3 years ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆132Updated last year
- ☆80Updated last year
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆106Updated 3 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆85Updated last year
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated 2 years ago
- A repository for benchmarking neural vocoders by their quality and speed.☆210Updated 3 weeks ago
- VAE Tacotron 2, an alternative of GST Tacotron☆88Updated last year
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Updated 11 months ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆83Updated 2 years ago
- A pytroch implementation of the FB-MelGAN☆89Updated 5 years ago
- An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".☆34Updated 4 years ago
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRL☆101Updated last year
- This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance No…☆115Updated 4 years ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆88Updated 2 years ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Updated 4 years ago
- Voice Conversion Challenge 2020 CycleVAE baseline system☆132Updated 4 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆154Updated 3 years ago
- Utility tools for the "Divide and Remaster" dataset, introduced as part of the Cocktail Fork problem paper: https://arxiv.org/abs/2110.09…☆72Updated 2 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆266Updated 3 years ago
- A sequence-to-sequence voice conversion toolkit.☆100Updated 11 months ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆62Updated 5 years ago
- PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation☆194Updated 3 years ago