abuccts / wikt2pron
A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format
☆33Updated 5 years ago
Alternatives and similar repositories for wikt2pron:
Users that are interested in wikt2pron are comparing it to the libraries listed below
- Python Finite-State Toolkit☆54Updated last month
- universal syllabification algorithms☆44Updated 2 years ago
- ☆10Updated 4 years ago
- ☆22Updated 3 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 2 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- Python framework for processing Universal Dependencies data☆56Updated this week
- ☆19Updated 3 years ago
- Helsinki Finite-State Technology (library and application suite)☆129Updated last week
- Python implementation of Levenshtein distance and Levenshtein automata matching☆27Updated 5 years ago
- Python module for syllabifying English ARPABET transcriptions☆66Updated 6 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆35Updated 2 months ago
- Expected edit distance implementation using OpenFst tools☆11Updated 9 years ago
- Cross-Linguistic Transcription Systems☆13Updated 4 months ago
- 🐍🍑 Python 3 library for managing, annotating, and converting natural language corpuses using popular formats (CoNLL, ELAN, Praat, CSV, …☆17Updated 9 months ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆191Updated 4 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- Runnable morphological analysis tools from the UniMorph project☆15Updated 6 years ago
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆40Updated 7 months ago
- Read-only unofficial mirror of Pynini☆17Updated 5 years ago
- linguistic data on the Yongning Na language☆7Updated last week
- Gamma Agreement in Python☆43Updated last year
- Featurize words into orthographic and phonological vectors.☆40Updated last year
- Automatic extraction of edited sentences from text edition histories.☆83Updated 3 years ago
- An English lexical database from the Big 🍎, let's go Mets baby love da Mets☆16Updated this week
- linguistics tree drawing to SVG in python, aimed at Jupyter☆63Updated 8 months ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Proposed splits for the LREC Wikipron paper☆14Updated 5 years ago
- Small-vocabulary neural sequence-to-sequence generation with optional feature conditioning☆33Updated last week
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars☆17Updated 10 months ago