henchc / syllabipy
universal syllabification algorithms
☆44Updated last year
Related projects ⓘ
Alternatives and complementary repositories for syllabipy
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆33Updated 5 years ago
- Python Finite-State Toolkit☆45Updated 2 weeks ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆15Updated 2 weeks ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- PHOIBLE data and development.☆121Updated 4 months ago
- Jason Riggle's chart of phonological features in JSON format + extras☆49Updated 4 months ago
- Unicode Standard tokenization routines and orthography profile segmentation☆33Updated 2 years ago
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings☆81Updated 6 months ago
- ☆22Updated 2 years ago
- pronunciation dictionaries for multiple languages☆83Updated 7 years ago
- Language data store and linguistic query API☆39Updated last month
- Cross-Linguistic Transcription Systems☆14Updated 7 months ago
- Python module for syllabifying English ARPABET transcriptions☆64Updated 5 years ago
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆221Updated 3 months ago
- Wiktionary parser tool for many language editions.☆53Updated 2 years ago
- The Unicode Cookbook for Linguists☆53Updated 4 years ago
- A repository containing links to useful phonological software☆11Updated last year
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆27Updated 3 years ago
- ☆10Updated 3 years ago
- [LREC 2020] EtymDB, an Etymological DataBase (v2.1)☆21Updated 2 years ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆62Updated 2 months ago
- A lexicon compiler for non-suffixational morphologies☆11Updated 4 months ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 2 years ago
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates☆43Updated last year
- ☆19Updated 3 years ago
- Acoustic distance measure for comparing pronunciations☆14Updated 2 years ago
- Phonological CorpusTools☆113Updated 3 weeks ago
- Python classes for the Buckeye Corpus☆21Updated 6 years ago
- A tool for automatic spelling normalization☆20Updated 3 years ago
- Utilities for manipulating finite state transducers with the OpenFst library.☆30Updated 7 years ago