dmort27 / epitran
A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)
☆671Updated 4 months ago
Alternatives and similar repositories for epitran:
Users that are interested in epitran are comparing it to the libraries listed below
- Massively multilingual pronunciation mining☆327Updated last month
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆231Updated 5 months ago
- g2p: English Grapheme To Phoneme Conversion☆831Updated 2 years ago
- Simple text to phones converter for multiple languages☆1,279Updated 3 months ago
- Allosaurus is a pretrained universal phone recognizer for more than 2000 languages☆585Updated 8 months ago
- Grapheme to phoneme conversion with deep learning.☆367Updated last year
- Phonetisaurus G2P☆457Updated 7 months ago
- CMU Wilderness Multilingual Speech Dataset☆273Updated 5 years ago
- A tool for automatic phoneme transcription☆157Updated last year
- 🙊 software for creating speech recognition models.☆154Updated 7 months ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆520Updated last year
- Python interface for forced audio alignment using HTK and SoX☆334Updated 4 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆468Updated 4 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆240Updated 5 years ago
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆320Updated last year
- A Python module for interacting with Praat TextGrid files. Also includes a class for reading HTK .mlf files into Praat☆286Updated last year
- Data and code for grapheme-to-phoneme transducers in lots of languages☆131Updated 9 months ago
- DeepSpeech based forced alignment tool☆235Updated 4 years ago
- Evaluate your speech-to-text system with similarity measures such as word error rate (WER)☆669Updated 2 months ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆295Updated 2 months ago
- Read, write, and manipulate Praat TextGrid files with Python☆126Updated last year
- CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)☆359Updated 3 years ago
- dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.☆486Updated last year
- CMU US English Dictionary☆642Updated last month
- Large, modern dataset for speech recognition☆656Updated 10 months ago
- Universal Romanizer that can convert any unicode script to roman (latin) script☆169Updated 5 months ago
- Converts English text to IPA notation☆371Updated last year
- Command line utility for forced alignment using Kaldi☆1,388Updated last month
- A collection of links and notes on forced alignment tools☆883Updated 3 years ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆144Updated this week