Edresson / TTS
Deep learning for Text to Speech
☆26Updated 3 years ago
Alternatives and similar repositories for TTS:
Users that are interested in TTS are comparing it to the libraries listed below
- Open Source Text-To-Speech Portuguese Dataset☆159Updated 11 months ago
- VAE Tacotron 2, an alternative of GST Tacotron☆88Updated last year
- Python toolkit for speech processing☆68Updated 3 weeks ago
- Interface for Controllable Expressive Talking Machine☆38Updated last year
- the Tensorflow version of multi-speaker TTS training with feedback constraint☆40Updated 4 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆140Updated last year
- Text Independent Speaker Verification Using GE2E Loss☆83Updated 6 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- Fre-GAN: Adversarial Frequency-consistent Audio Synthesis☆101Updated 3 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆190Updated 3 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆70Updated 3 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆115Updated 2 years ago
- [INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning☆80Updated 2 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Updated 2 years ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Updated 3 years ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆88Updated 2 years ago
- ☆40Updated 2 years ago
- A pytroch implementation of the FB-MelGAN☆88Updated 4 years ago
- PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.☆72Updated 3 years ago
- An implement of GlowTTS model. Several modes are added: speaker embedding, prosody encoder(GST), and gradient reversal.☆53Updated 2 years ago
- Implementation code of non-parallel sequence-to-sequence VC☆248Updated last year
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆83Updated last year
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆48Updated 8 months ago
- PyTorch Implementation of Generalized End-to-End Loss for Speaker Verification☆29Updated 4 years ago
- Real-Time High-Fidelity Speech Synthesis without GPU☆74Updated 6 months ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆88Updated 2 years ago
- Non-Parallel Voice Conversion with Cyclic Variational Autoencoder☆52Updated 4 years ago
- Alignment files of LibriTTS.☆61Updated 4 years ago