Kyubyong / g2pLinks
g2p: English Grapheme To Phoneme Conversion
☆909Updated 3 years ago
Alternatives and similar repositories for g2p
Users that are interested in g2p are comparing it to the libraries listed below
Sorting:
- Allosaurus is a pretrained universal phone recognizer for more than 2000 languages☆691Updated last year
- Grapheme to phoneme conversion with deep learning.☆419Updated 2 years ago
- Simple text to phones converter for multiple languages☆1,501Updated last year
- Phonetisaurus G2P☆504Updated last year
- Large, modern dataset for speech recognition☆716Updated last year
- A Generative Flow for Text-to-Speech via Monotonic Alignment Search☆700Updated 3 years ago
- The Implementation of FastSpeech based on pytorch.☆880Updated 2 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆480Updated 5 years ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,382Updated last year
- List of speech synthesis papers.☆1,060Updated 2 years ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆569Updated 2 years ago
- Tools for handling multimodal data in machine learning projects.☆1,105Updated last week
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆538Updated 3 years ago
- A collection of links and notes on forced alignment tools☆935Updated 4 years ago
- A Python wrapper for the high-quality vocoder "World"☆779Updated last year
- DeepSpeech based forced alignment tool☆239Updated 5 years ago
- A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)☆790Updated last month
- Unsupervised Speech Decomposition Via Triple Information Bottleneck☆698Updated last year
- End-to-end ASR/LM implementation with PyTorch☆594Updated 4 years ago
- A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"☆691Updated 2 years ago
- A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis☆374Updated 3 years ago
- PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.☆595Updated 4 years ago
- Python interface for forced audio alignment using HTK and SoX☆348Updated 5 years ago
- Command line utility for forced alignment using Kaldi☆1,727Updated 2 weeks ago
- A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset☆362Updated 4 years ago
- UniSpeech - Large Scale Self-Supervised Learning for Speech☆477Updated last year
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆378Updated 2 years ago
- dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.☆520Updated 2 years ago
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"☆366Updated 7 years ago
- Charsiu: A neural phonetic aligner.☆327Updated 3 years ago