abuccts / wikt2pron
A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format
β33Updated 5 years ago
Alternatives and similar repositories for wikt2pron:
Users that are interested in wikt2pron are comparing it to the libraries listed below
- An English lexical database from the Big π, let's go Mets baby love da Metsβ14Updated 3 weeks ago
- Python Finite-State Toolkitβ50Updated last month
- universal syllabification algorithmsβ43Updated 2 years ago
- Calculates the Word Error Rate between two text filesβ20Updated 2 years ago
- LingPy: Python library for quantitative tasks in historical linguisticsβ128Updated last year
- β10Updated 3 years ago
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammarsβ15Updated 8 months ago
- β22Updated 2 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forestsβ41Updated 2 years ago
- Unicode Standard tokenization routines and orthography profile segmentationβ35Updated this week
- Gamma Agreement in Pythonβ43Updated 11 months ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree languageβ15Updated this week
- linguistic data on the Yongning Na languageβ7Updated this week
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).β61Updated last month
- phone inventory libraryβ16Updated last year
- Jason Riggle's chart of phonological features in JSON format + extrasβ51Updated 7 months ago
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning meβ¦β40Updated 5 months ago
- SegBo: A database of borrowed sounds in the worldβs languagesβ16Updated 11 months ago
- Multilingual grapheme-to-phoneme conversionβ20Updated 6 years ago
- Expected edit distance implementation using OpenFst toolsβ11Updated 9 years ago
- Python implementation of Levenshtein distance and Levenshtein automata matchingβ27Updated 5 years ago
- Python framework for processing Universal Dependencies dataβ55Updated 2 weeks ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.β34Updated last year
- β19Updated 3 years ago
- Python wrapper for phonetisaurus grapheme to phoneme toolβ12Updated 3 years ago
- English web corpus with 4M tokens and several annotation typesβ26Updated last year
- β12Updated 2 years ago
- Small-vocabulary neural sequence-to-sequence generation with optional feature conditioningβ33Updated last week
- Breaks a word into syllables using an LSTM-based neural network.β19Updated last year
- Automatically exported from code.google.com/p/m2m-alignerβ42Updated 8 years ago