abuccts / wikt2pron
A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format
☆33Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for wikt2pron
- universal syllabification algorithms☆44Updated last year
- Python Finite-State Toolkit☆44Updated 3 months ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 2 years ago
- Gamma Agreement in Python☆43Updated 8 months ago
- ☆10Updated 3 years ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆15Updated this week
- ☆19Updated 3 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆33Updated 2 years ago
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆40Updated 2 months ago
- Calculates the Word Error Rate between two text files☆20Updated last year
- Multilingual grapheme-to-phoneme conversion☆19Updated 6 years ago
- An English lexical database from the Big 🍎, let's go Mets baby love da Mets☆14Updated last week
- Python module for syllabifying English ARPABET transcriptions☆64Updated 5 years ago
- Language Acquisition Research Tools☆37Updated 7 months ago
- Python implementation of Levenshtein distance and Levenshtein automata matching☆27Updated 5 years ago
- Runnable morphological analysis tools from the UniMorph project☆14Updated 5 years ago
- Transform TMX to text☆29Updated last year
- Featurize words into orthographic and phonological vectors.☆40Updated last year
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆15Updated last year
- Cross-Linguistic Transcription Systems☆14Updated 6 months ago
- LingPy: Python library for quantitative tasks in historical linguistics☆124Updated 10 months ago
- Wiktionary parser tool for many language editions.☆53Updated 2 years ago
- Fast Word Clustering Software☆74Updated 2 months ago
- ☆12Updated 8 years ago
- Expected edit distance implementation using OpenFst tools☆11Updated 9 years ago
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆219Updated 3 months ago
- PHOIBLE data and development.☆121Updated 4 months ago
- Automatically exported from code.google.com/p/m2m-aligner☆41Updated 8 years ago
- Generic Environment for Context-Aware Correction of Orthography☆22Updated 2 years ago
- Convert words to numbers☆20Updated 2 years ago