rhasspy / gruutLinks
A tokenizer, text cleaner, and phonemizer for many human languages.
β326Updated 11 months ago
Alternatives and similar repositories for gruut
Users that are interested in gruut are comparing it to the libraries listed below
Sorting:
- Grapheme to phoneme conversion with deep learning.β404Updated last year
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ259Updated 2 years ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!β177Updated last week
- phoneme tokenizer and grapheme-to-phoneme model for 8k languagesβ170Updated 2 years ago
- Multilingual G2P in 100 languagesβ360Updated 2 years ago
- Segment an audio file and obtain utterance alignments. (Python package)β342Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ151Updated last year
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speechβ229Updated 3 years ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretationβ560Updated 2 years ago
- Finetune VITS and MMS using HuggingFace's toolsβ168Updated last year
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speechβ341Updated 3 years ago
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, β¦β290Updated 2 years ago
- β261Updated 2 years ago
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised durationβ¦β326Updated 3 years ago
- NeMo text processing for ASR and TTSβ379Updated last week
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- πΈSTT integration examplesβ129Updated 3 years ago
- Universal Romanizer that can convert any unicode script to roman (latin) scriptβ226Updated last year
- Data and code for grapheme-to-phoneme transducers in lots of languagesβ140Updated last year
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictionsβ261Updated 9 months ago
- Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)β278Updated 4 years ago
- VCTK multi-speaker tacotron for ICASSP 2020β265Updated 3 years ago
- DeepSpeech based forced alignment toolβ239Updated 4 years ago
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpusβ217Updated 3 years ago
- Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023β242Updated 4 months ago
- A live speech recognition using Facebooks wav2vec 2.0 model.β372Updated last year
- [WIP] VoiceSmith makes training text to speech models easy.β226Updated 3 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modelingβ190Updated 3 years ago
- β378Updated last year
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)β95Updated last year