Edresson / SC-GlowTTS
SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model
☆106Updated 3 years ago
Related projects: ⓘ
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆71Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆89Updated 2 years ago
- A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech☆126Updated 4 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆81Updated last year
- ☆74Updated 2 years ago
- asr2k☆48Updated 3 months ago
- Unofficial Pytorch Implementation of WaveGrad2☆111Updated 3 years ago
- Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021☆39Updated 3 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆84Updated 3 years ago
- Neural HMMs are all you need (for high-quality attention-free TTS)☆151Updated last month
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆115Updated 2 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆188Updated 2 years ago
- ☆56Updated last year
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆45Updated last year
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆56Updated 2 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated last year
- ☆110Updated 2 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆109Updated 2 years ago
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆43Updated 4 years ago
- ☆75Updated 3 months ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 2 years ago
- Pytorch implementation of "Efficienttts: an efficient and high-quality text-to-speech architecture"☆115Updated 2 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆139Updated last year
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 3 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆138Updated last year
- VAE Tacotron 2, an alternative of GST Tacotron☆85Updated last year
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆78Updated 2 years ago
- ☆31Updated 2 weeks ago
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆144Updated 2 years ago