☆141Oct 18, 2023Updated 2 years ago
Alternatives and similar repositories for GameTTS
Users that are interested in GameTTS are comparing it to the libraries listed below
Sorting:
- Viterbi decoding in PyTorch☆41Sep 10, 2025Updated 5 months ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆26Mar 24, 2023Updated 2 years ago
- Prosody and Pronunciation Modification Network☆63May 5, 2025Updated 10 months ago
- A tensorflow based implementation of DeepVoice3 https://arxiv.org/abs/1710.07654☆13Jun 5, 2018Updated 7 years ago
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆67Jan 7, 2023Updated 3 years ago
- Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectr…☆17Apr 27, 2023Updated 2 years ago
- Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to…☆36Jan 16, 2021Updated 5 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆36Mar 31, 2023Updated 2 years ago
- ☆18Jan 17, 2022Updated 4 years ago
- ☆16Dec 23, 2021Updated 4 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Dec 17, 2020Updated 5 years ago
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, …☆291Apr 6, 2023Updated 2 years ago
- pytorch model for contexless-phoneme prediction from speech audio☆32Oct 30, 2025Updated 4 months ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Dec 1, 2021Updated 4 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Sep 10, 2021Updated 4 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆24Oct 19, 2023Updated 2 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Jul 14, 2022Updated 3 years ago
- ICASSP 2023 Accepted☆190May 6, 2024Updated last year
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- A tensorflow implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis☆20Oct 23, 2019Updated 6 years ago
- 🐸TTS recipes for different datasets☆86Jul 26, 2022Updated 3 years ago
- scipts for working with open.bible data☆26Jan 24, 2022Updated 4 years ago
- Interface for using TTS and vocoder models in the form of a text editor☆19Nov 25, 2025Updated 3 months ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Jul 31, 2023Updated 2 years ago
- ☆24Sep 27, 2022Updated 3 years ago
- Tensorflow implementation of "FloWaveNet: A Generative Flow for Raw Audio"☆25Apr 19, 2019Updated 6 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆25Jul 27, 2024Updated last year
- 56 language, 1 model Multilingual ASR☆24Jul 25, 2021Updated 4 years ago
- ☆67Aug 16, 2023Updated 2 years ago
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 3 years ago
- ☆25Mar 12, 2022Updated 3 years ago
- Non Parallel Voice Conversion based on VITS☆24Mar 31, 2023Updated 2 years ago
- This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631☆23Aug 15, 2022Updated 3 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆332Nov 15, 2024Updated last year
- ☆10Nov 10, 2022Updated 3 years ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆33Jul 31, 2024Updated last year