petronny / g2p
Pre-trained grapheme-to-phoneme (G2P) models
☆25Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for g2p
- Speech samples and code of BEdit-TTS☆32Updated last year
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆39Updated last year
- ☆41Updated 4 years ago
- Chinese Text Normalization and Dataset☆81Updated 2 years ago
- multilingual speech aligner☆71Updated 11 months ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- Python wrapper for kaldi's arpa2fst☆38Updated last year
- it's ASR decoder and make graph project☆32Updated 2 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆46Updated 4 months ago
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 2 years ago
- Implementation of the subscale framework from the WaveRNN paper, building on top of Fatchord's WaveRNN repo☆17Updated 4 years ago
- ☆61Updated last year
- Neural network-based forced alignment with bidirectional attention mechanism☆70Updated 2 years ago
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆23Updated 2 months ago
- Objective metrics used in several text-to-speech (TTS) papers.☆46Updated 2 years ago
- CMU multilingual speech repository☆31Updated 2 years ago
- ☆21Updated 8 months ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated last year
- video cut powered by AI☆25Updated last year
- AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data☆29Updated 10 months ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆35Updated 4 years ago
- Text frontend for ESPnet tts recipes☆31Updated 3 years ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆31Updated 4 months ago
- ☆22Updated 5 years ago
- ☆53Updated 3 years ago
- MagicData-RAMC Dataset and Baseline☆49Updated 2 years ago
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆61Updated 3 years ago
- TTS-frontend with Bert and CRF/lstm (For Tacotron)☆49Updated 4 years ago
- ☆62Updated 2 years ago