colingoldberg / morphemesLinks
Common English morphemes, organized for automated access.
☆9Updated 6 years ago
Alternatives and similar repositories for morphemes
Users that are interested in morphemes are comparing it to the libraries listed below
Sorting:
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆47Updated 2 years ago
- Python Finite-State Toolkit☆56Updated this week
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆30Updated 5 years ago
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates☆49Updated 2 years ago
- The Language Independent Intelligent Dictionary☆25Updated last week
- A modern, interlingual wordnet interface for Python☆251Updated last week
- Fast and robust date extraction from web pages, with Python or on the command-line☆131Updated 6 months ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆164Updated 3 weeks ago
- Central Alaskan Yup'ik FST morphological analyzer/generator☆12Updated last month
- A set of pipelines for performing experiments on various NLP tasks with a focus on resource-poor/minority languages.☆36Updated this week
- Domain-specific programming language for linguistic grammars and transducers — Langage dédié pour les grammaires linguistiques et les tra…☆14Updated last week
- This packages up data for the Open Multilingual Wordnet☆49Updated 3 weeks ago
- CLDF: Cross-Linguistic Data Formats - the specification☆57Updated last year
- Crawler for linguistic corpora☆204Updated last year
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆74Updated 2 months ago
- Lexical database for ~70k English words with morphological variables☆44Updated 3 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆102Updated last month
- Morphological Dictionaries for German Language☆29Updated 7 years ago
- Runnable morphological analysis tools from the UniMorph project☆16Updated 6 years ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆15Updated 5 years ago
- English Lemma Database - Compiled by Referencing British National Corpus☆31Updated 9 months ago
- Python API to access glottolog/glottolog☆29Updated 2 weeks ago
- Curated corpus of parallel data derived from versions of the Bible provided by eBible.org.☆66Updated last month
- A character-wise tokenizer for morphologically rich languages☆27Updated 3 months ago
- python package to read and write CLDF datasets☆18Updated 2 months ago
- Yet another search platform for linguistic corpora.☆26Updated 2 weeks ago
- 🙊 software for creating speech recognition models.☆159Updated last year
- ParlaMint: Comparable Parliamentary Corpora☆62Updated this week
- An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship ty…☆124Updated last year
- Unicode Standard tokenization routines and orthography profile segmentation☆37Updated 4 months ago