hugomailhot / MorphoLex-en
Lexical database for ~70k English words with morphological variables
☆42Updated 3 years ago
Alternatives and similar repositories for MorphoLex-en:
Users that are interested in MorphoLex-en are comparing it to the libraries listed below
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆40Updated last year
- Efficient Low-Memory Aligner☆142Updated last month
- An initiative to collect and distribute resources for co-reference resolution in a unified standard.☆24Updated 9 months ago
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆30Updated 2 months ago
- A simple toolkit for conducting analyses using corpus methods☆25Updated 3 years ago
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆65Updated 2 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆79Updated last year
- Repository for the Georgetown University Multilayer Corpus (GUM)☆92Updated this week
- Various utilities for processing the data.☆208Updated this week
- MFTE (Multi Feature Tagger of English) Python is the Python version based on Le Foll's MFTE written in Perl. It is extended to include se…☆22Updated last month
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆25Updated last year
- A Python package for learning, evaluating, annotating, and extracting vector representations of construction grammars☆35Updated 4 months ago
- A multilingual parallel corpus created from translations of the Bible.☆177Updated 5 months ago
- Python framework for processing Universal Dependencies data☆55Updated 3 weeks ago
- Python Multilingual Ucrel Semantic Analysis System☆31Updated 6 months ago
- Sentence aligner☆110Updated 3 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- ☆63Updated 9 months ago
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆66Updated 2 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- https://sites.google.com/site/multidimensionaltagger☆31Updated last year
- English data☆205Updated this week
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆112Updated 9 months ago
- A character-wise tokenizer for morphologically rich languages☆27Updated 2 months ago
- [LREC 2020] EtymDB, an Etymological DataBase (v2.1)☆24Updated 3 years ago
- ☆11Updated last year
- Contextualised Word Representations for Lexical Semantic Change Analysis☆31Updated 4 years ago
- Compiled tools, datasets, and other resources for historical text normalization.☆18Updated 5 years ago
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates☆47Updated last year
- The Universal Decompositional Semantics (UDS) dataset and the Decomp toolkit☆57Updated last year