michmech / lemmatization-lists
Machine-readable lists of lemma-token pairs in 23 languages.
☆335Updated 2 years ago
Alternatives and similar repositories for lemmatization-lists:
Users that are interested in lemmatization-lists are comparing it to the libraries listed below
- A modern, interlingual wordnet interface for Python☆229Updated last month
- 📂 Additional lookup tables and data resources for spaCy☆99Updated last year
- English Lemma Database - Compiled by Referencing British National Corpus☆30Updated 3 months ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆151Updated last month
- A tokenizer and sentence splitter for German and English web and social media texts.☆137Updated last month
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆372Updated last month
- A python module for English lemmatization and inflection.☆265Updated last year
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆234Updated 2 years ago
- All languages stopwords collection☆426Updated last year
- Sentence aligner☆109Updated 3 years ago
- Universal Dependencies online documentation☆278Updated this week
- A fast and accurate POS and morphological tagging toolkit (EACL 2014)☆141Updated 4 years ago
- A multilingual parallel corpus created from translations of the Bible.☆177Updated 3 months ago
- The Open English WordNet☆493Updated this week
- spaCy + UDPipe☆161Updated 2 years ago
- Gather modern English word frequencies from all enwiki articles.☆207Updated 10 months ago
- German Morphological Analyzer☆47Updated 3 years ago
- WordNet in JSON format.☆91Updated 4 years ago
- A character-wise tokenizer for morphologically rich languages☆27Updated last month
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆17Updated this week
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆253Updated 4 months ago
- Freeling wrapper☆12Updated 8 years ago
- Modern spell checking library - accurate, fast, multi-language☆621Updated 4 months ago
- Wiktionary parser tool for many language editions.☆53Updated 2 years ago
- Open German WordNet☆89Updated 11 months ago
- Morphological Dictionaries for German Language☆28Updated 6 years ago
- List of common stop words in various languages.☆331Updated 2 years ago
- Language Tool style grammar handling with spaCy 2.0☆42Updated 6 years ago
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆311Updated this week
- A lemmatizer for German language text☆87Updated last year