wswu / yawipa
A comprehensive and extensible Wiktionary parsing framework.
☆20Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for yawipa
- A lexicon compiler for non-suffixational morphologies☆11Updated 4 months ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆27Updated 3 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆36Updated last year
- ☆19Updated 3 years ago
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars☆15Updated 4 months ago
- Scripts for compatibilitising between VISL-CG3, Apertium, CoNLL-X and Universal Dependencies☆14Updated 4 years ago
- A set of pipelines for performing experiments on various NLP tasks with a focus on resource-poor/minority languages.☆34Updated this week
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆15Updated this week
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆24Updated last year
- Central Alaskan Yup'ik FST morphological analyzer/generator☆12Updated last month
- The Grammar Matrix☆12Updated this week
- A Python package for learning, evaluating, annotating, and extracting vector representations of construction grammars☆34Updated 3 weeks ago
- A repository containing links to useful phonological software☆11Updated last year
- Morphological analysis and generation of Amharic, Oromo, and Tigrinya☆11Updated 7 years ago
- A repository for the 2022 Inflection Shared Task☆9Updated 2 years ago
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆27Updated 5 years ago
- Cross-Linguistic Transcription Systems☆14Updated 6 months ago
- The Unicode Cookbook for Linguists☆53Updated 3 years ago
- Austronesian Comparative Dictionary☆11Updated last year
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- Latin BERT☆56Updated 4 months ago
- CLDF: Cross-Linguistic Data Formats - the specification☆55Updated 6 months ago
- A list of vocabulary lists☆21Updated 4 years ago
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates☆43Updated last year
- A cloud-based, open-source system for writing and publishing dictionaries.☆86Updated 10 months ago
- Python API to access glottolog/glottolog☆28Updated 2 weeks ago
- Collaborative data curation for Glottolog☆152Updated last week
- Runnable morphological analysis tools from the UniMorph project☆14Updated 5 years ago
- Improved Sentence Alignment in Linear Time and Space☆163Updated last year
- A modern, interlingual wordnet interface for Python☆218Updated this week