rhasspy / gruutLinks
A tokenizer, text cleaner, and phonemizer for many human languages.
β325Updated 10 months ago
Alternatives and similar repositories for gruut
Users that are interested in gruut are comparing it to the libraries listed below
Sorting:
- Grapheme to phoneme conversion with deep learning.β397Updated last year
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ256Updated 2 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languagesβ169Updated 2 years ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!β174Updated this week
- Segment an audio file and obtain utterance alignments. (Python package)β341Updated last year
- Multilingual G2P in 100 languagesβ355Updated 2 years ago
- PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speechβ341Updated 3 years ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretationβ558Updated 2 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speechβ229Updated 3 years ago
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised durationβ¦β325Updated 2 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ150Updated last year
- Finetune VITS and MMS using HuggingFace's toolsβ163Updated last year
- VCTK multi-speaker tacotron for ICASSP 2020β265Updated 3 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languagesβ139Updated last year
- β260Updated 2 years ago
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, β¦β290Updated 2 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.β364Updated last year
- DeepSpeech based forced alignment toolβ239Updated 4 years ago
- Collection of pretrained models for the Montreal Forced Alignerβ164Updated 2 months ago
- Charsiu: A neural phonetic aligner.β312Updated 2 years ago
- Various speech datasets made available to the publicβ130Updated 9 months ago
- πΈSTT integration examplesβ129Updated 2 years ago
- Model for recasing and repunctuating ASR transcriptsβ138Updated last year
- NeMo text processing for ASR and TTSβ369Updated last week
- Gecko - A Tool for Effective Annotation of Human Conversationsβ297Updated 2 years ago
- Timething is a library for aligning text transcripts with their audio recordings.β122Updated 9 months ago
- Universal Romanizer that can convert any unicode script to roman (latin) scriptβ221Updated last year
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Textβ244Updated 5 years ago
- [WIP] VoiceSmith makes training text to speech models easy.β225Updated 2 years ago
- UniSpeech - Large Scale Self-Supervised Learning for Speechβ467Updated last year