CUNY-CL / wikipron
Massively multilingual pronunciation mining
☆338Updated 3 weeks ago
Alternatives and similar repositories for wikipron:
Users that are interested in wikipron are comparing it to the libraries listed below
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆252Updated 8 months ago
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings☆84Updated 11 months ago
- A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)☆707Updated last week
- Grapheme to phoneme conversion with deep learning.☆381Updated last year
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆158Updated last week
- 🙊 software for creating speech recognition models.☆159Updated 10 months ago
- Phonetisaurus G2P☆471Updated 10 months ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆135Updated last year
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆159Updated last year
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆90Updated last year
- g2p: English Grapheme To Phoneme Conversion☆849Updated 2 years ago
- Read, write, and manipulate Praat TextGrid files with Python☆128Updated last year
- A Python module for interacting with Praat TextGrid files. Also includes a class for reading HTK .mlf files into Praat☆293Updated last year
- Universal Romanizer that can convert any unicode script to roman (latin) script☆192Updated 8 months ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆242Updated 5 years ago
- CMU Wilderness Multilingual Speech Dataset☆278Updated 6 years ago
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- Allosaurus is a pretrained universal phone recognizer for more than 2000 languages☆621Updated 11 months ago
- Charsiu: A neural phonetic aligner.☆297Updated 2 years ago
- Linguistic processing for Common Voice☆55Updated last year
- This is a github repository of the abandonware Sequitur G2P by Bisani & Ney☆162Updated 9 months ago
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆327Updated last year
- Python interface for forced audio alignment using HTK and SoX☆337Updated 4 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆335Updated 11 months ago
- Praat textgrid manipulation in Python☆52Updated 3 weeks ago
- A phoneme-allophone database for many languages☆52Updated 4 years ago
- Universal multilingual automatic speech transcription into IPA☆64Updated last month
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆33Updated last year
- Python module for syllabifying English ARPABET transcriptions☆66Updated 6 years ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆531Updated 2 years ago