roedoejet / g2p
Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!
☆154Updated last week
Alternatives and similar repositories for g2p:
Users that are interested in g2p are comparing it to the libraries listed below
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆156Updated last year
- Data and code for grapheme-to-phoneme transducers in lots of languages☆132Updated 11 months ago
- a curated list of speech datasets (110+ datasets, 75+ easy to download)☆127Updated 2 years ago
- Multilingual G2P in 100 languages☆309Updated last year
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆68Updated 6 months ago
- Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context☆192Updated 6 months ago
- Predicts the level of noise and reverberation on your audiofiles☆144Updated 9 months ago
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆87Updated last year
- Charsiu: A neural phonetic aligner.☆295Updated 2 years ago
- Collection of pretrained models for the Montreal Forced Aligner☆133Updated 8 months ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆25Updated last month
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆147Updated last year
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆33Updated last year
- Unofficial implementation of miipher☆120Updated 11 months ago
- Various speech datasets made available to the public☆114Updated 3 months ago
- Universal multilingual automatic speech transcription into IPA☆62Updated 3 weeks ago
- High-Fidelity Neural Phonetic Posteriorgrams☆106Updated 3 weeks ago
- Reference-aware automatic speech evaluation toolkit☆144Updated 3 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆98Updated last month
- ☆112Updated 2 years ago
- UT-Sarulab MOS prediction system using SSL models☆216Updated 11 months ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆40Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated last year
- Neural HMMs are all you need (for high-quality attention-free TTS)☆158Updated 2 weeks ago
- Praat textgrid manipulation in Python☆52Updated last year
- UTokyo-SaruLab MOS Prediction System☆155Updated 3 weeks ago
- ☆80Updated 9 months ago
- multilingual speech aligner☆72Updated last year
- This is the M-AILABS Speech Dataset☆46Updated 3 months ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆88Updated 2 years ago