roedoejet / g2p
Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!
☆147Updated 3 weeks ago
Alternatives and similar repositories for g2p:
Users that are interested in g2p are comparing it to the libraries listed below
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆153Updated last year
- Multilingual G2P in 100 languages☆298Updated last year
- Data and code for grapheme-to-phoneme transducers in lots of languages☆131Updated 10 months ago
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆86Updated last year
- Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context☆188Updated 5 months ago
- Collection of pretrained models for the Montreal Forced Aligner☆127Updated 7 months ago
- A sequence-to-sequence voice conversion toolkit.☆93Updated 7 months ago
- ☆79Updated 8 months ago
- ☆111Updated 2 years ago
- Universal multilingual automatic speech transcription into IPA☆58Updated 5 months ago
- a curated list of speech datasets (110+ datasets, 75+ easy to download)☆122Updated 2 years ago
- Predicts the level of noise and reverberation on your audiofiles☆143Updated 8 months ago
- Charsiu: A neural phonetic aligner.☆289Updated 2 years ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆64Updated 5 months ago
- Reference-aware automatic speech evaluation toolkit☆142Updated 2 months ago
- multilingual speech aligner☆72Updated last year
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆40Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated last year
- SelfRemaster: SSL Speech Restoration☆88Updated last year
- MOS score prediction by fine-tuned wav2vec2.0 model☆151Updated 2 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆24Updated this week
- Read, write, and manipulate Praat TextGrid files with Python☆126Updated last year
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆143Updated 11 months ago
- This is the M-AILABS Speech Dataset☆40Updated 2 months ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆240Updated 5 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆107Updated 2 years ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆88Updated 2 years ago
- Unofficial implementation of miipher☆118Updated 9 months ago
- ☆34Updated 5 months ago
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆146Updated 2 years ago