abuccts / wikt2pron
A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format
☆33Updated 5 years ago
Alternatives and similar repositories for wikt2pron:
Users that are interested in wikt2pron are comparing it to the libraries listed below
- Python Finite-State Toolkit☆53Updated last week
- universal syllabification algorithms☆43Updated 2 years ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆15Updated this week
- ☆19Updated 3 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆35Updated 2 weeks ago
- ☆10Updated 3 years ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆25Updated last year
- Proposed splits for the LREC Wikipron paper☆14Updated 4 years ago
- Python framework for processing Universal Dependencies data☆55Updated this week
- A guide to building language technology in new languages.☆58Updated 3 years ago
- Automatically exported from code.google.com/p/m2m-aligner☆42Updated 8 years ago
- Multilingual grapheme-to-phoneme conversion☆20Updated 7 years ago
- Python implementation of Levenshtein distance and Levenshtein automata matching☆27Updated 5 years ago
- ☆22Updated 2 years ago
- Jason Riggle's chart of phonological features in JSON format + extras☆53Updated 8 months ago
- Calculates the Word Error Rate between two text files☆20Updated 2 years ago
- ☆12Updated 9 years ago
- Small-vocabulary neural sequence-to-sequence generation with optional feature conditioning☆33Updated this week
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆40Updated 6 months ago
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars☆16Updated 8 months ago
- Gamma Agreement in Python☆43Updated last year
- These are lists for a variety of languages containing words that are distinctive to each language.☆36Updated 2 years ago
- Tool to fix bitexts and tag near-duplicates for removal☆30Updated last month
- Utilities for manipulating finite state transducers with the OpenFst library.☆31Updated 7 years ago
- phone inventory library☆16Updated last year
- Cross-Linguistic Transcription Systems☆14Updated 2 months ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆17Updated 2 years ago
- PHOIBLE data and development.☆122Updated 8 months ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆191Updated 4 years ago
- Automatic extraction of edited sentences from text edition histories.☆82Updated 3 years ago