abuccts / wikt2pron
A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format
☆33Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for wikt2pron
- universal syllabification algorithms☆44Updated last year
- Python Finite-State Toolkit☆45Updated 2 weeks ago
- An English lexical database from the Big 🍎, let's go Mets baby love da Mets☆15Updated 3 weeks ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆15Updated 2 weeks ago
- ☆10Updated 3 years ago
- Multilingual grapheme-to-phoneme conversion☆19Updated 6 years ago
- Small-vocabulary sequence-to-sequence generation with optional feature conditioning☆31Updated this week
- Morfessor EM+Prune☆10Updated 4 years ago
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆40Updated 2 months ago
- Utilities for manipulating finite state transducers with the OpenFst library.☆30Updated 7 years ago
- Jason Riggle's chart of phonological features in JSON format + extras☆49Updated 4 months ago
- Runnable morphological analysis tools from the UniMorph project☆14Updated 6 years ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆62Updated 2 months ago
- Unicode Standard tokenization routines and orthography profile segmentation☆33Updated 2 years ago
- Gamma Agreement in Python☆43Updated 8 months ago
- Transform TMX to text☆29Updated 2 years ago
- Cross-Linguistic Transcription Systems☆14Updated 7 months ago
- pronunciation dictionaries for multiple languages☆83Updated 7 years ago
- Python module for syllabifying English ARPABET transcriptions☆64Updated 5 years ago
- bilingual dictionary extractor from parallel corpora☆22Updated 10 years ago
- Python framework for processing Universal Dependencies data☆57Updated this week
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆221Updated 3 months ago
- MAGPIE: A sense-annotated corpus of potentially idiomatic expressions☆25Updated 4 years ago
- Tool to fix bitexts and tag near-duplicates for removal☆29Updated 3 months ago
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars☆15Updated 5 months ago
- LingPy: Python library for quantitative tasks in historical linguistics☆125Updated 11 months ago
- Expected edit distance implementation using OpenFst tools☆11Updated 9 years ago
- ☆12Updated 8 years ago
- A simple neural truecaser written in pytorch and allennlp.☆32Updated 5 months ago
- Calculates the Word Error Rate between two text files☆20Updated 2 years ago