Edresson / SC-GlowTTS
SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model
☆107Updated 3 years ago
Alternatives and similar repositories for SC-GlowTTS:
Users that are interested in SC-GlowTTS are comparing it to the libraries listed below
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆88Updated 2 years ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆131Updated last year
- A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech☆128Updated 4 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated last year
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆74Updated 3 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆117Updated 2 years ago
- Neural HMMs are all you need (for high-quality attention-free TTS)☆158Updated 2 weeks ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆110Updated 2 years ago
- ☆74Updated 3 years ago
- asr2k☆49Updated 9 months ago
- ☆80Updated 9 months ago
- Unofficial Pytorch Implementation of WaveGrad2☆112Updated 3 years ago
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…☆159Updated 3 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆140Updated last year
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023☆212Updated last year
- A repository for benchmarking neural vocoders by their quality and speed.☆208Updated this week
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆56Updated 3 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago
- VAE Tacotron 2, an alternative of GST Tacotron☆88Updated last year
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆146Updated 2 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 3 years ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆84Updated 2 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- Voice Conversion Challenge 2020 CycleVAE baseline system☆133Updated 4 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆170Updated 7 months ago
- Implementation of the AlignTTS☆76Updated last year
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆79Updated 3 years ago
- ☆163Updated 2 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆168Updated last year