Edresson / TTS
Deep learning for Text to Speech
☆27Updated 4 years ago
Alternatives and similar repositories for TTS:
Users that are interested in TTS are comparing it to the libraries listed below
- Open Source Text-To-Speech Portuguese Dataset☆161Updated last year
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Updated 3 years ago
- Interface for Controllable Expressive Talking Machine☆38Updated last year
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆61Updated 4 years ago
- Python toolkit for speech processing☆68Updated this week
- LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search☆86Updated 3 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25Updated 5 years ago
- Text Independent Speaker Verification Using GE2E Loss☆84Updated 6 years ago
- Constrained Permutation Invariant Training, Speech Separation☆47Updated 4 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆143Updated last year
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 6 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆239Updated 4 years ago
- A sequence-to-sequence voice conversion toolkit.☆96Updated 9 months ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆81Updated last year
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆154Updated 3 years ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Updated 6 years ago
- End-to-end spoken language identification out of the box.☆48Updated 4 years ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆40Updated 3 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 4 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆83Updated last year
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆166Updated last year
- Filtering and Noise Adding Tool☆29Updated 2 years ago
- VoxLingua107 recipe for SpeechBrain☆13Updated 3 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- Clustering-based methods for overlapping diarization☆80Updated last year
- VAE Tacotron 2, an alternative of GST Tacotron☆88Updated last year
- Code for the paper "Investigating the effect of residual and highway connections in speech enhancement models"☆11Updated 6 years ago