shivammehta25 / Neural-HMMLinks
Neural HMMs are all you need (for high-quality attention-free TTS)
☆159Updated last month
Alternatives and similar repositories for Neural-HMM
Users that are interested in Neural-HMM are comparing it to the libraries listed below
Sorting:
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆146Updated 3 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- ☆163Updated 2 years ago
- Unofficial Pytorch Implementation of WaveGrad2☆112Updated 3 years ago
- Official repository of https://doi.org/10.1109/TASLP.2022.3167258. More up-to-date code is in "refactor" branch.☆190Updated 2 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated 2 years ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆88Updated 3 years ago
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…☆159Updated 2 months ago
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆261Updated 3 weeks ago
- PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation☆194Updated 3 years ago
- ☆80Updated last year
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆123Updated 3 years ago
- Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023☆227Updated 2 months ago
- Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)☆117Updated 2 years ago
- Pytorch implementation of "Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech"☆197Updated last year
- Official code for Wav2Seq☆95Updated 3 years ago
- A repository for benchmarking neural vocoders by their quality and speed.☆210Updated 2 months ago
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus☆207Updated 2 years ago
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆91Updated 4 months ago
- Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.☆122Updated last year
- A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech☆127Updated 5 years ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆201Updated 2 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆77Updated last year
- Official implementation of BVAE-TTS☆173Updated 2 years ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆46Updated 3 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆110Updated 3 years ago
- A sequence-to-sequence voice conversion toolkit.☆102Updated last year
- A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis☆114Updated 4 years ago
- A PyTorch implementation of the universal neural vocoder☆67Updated 4 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago