Machine-readable lists of lemma-token pairs in 23 languages.
☆361Jan 29, 2022Updated 4 years ago
Alternatives and similar repositories for lemmatization-lists
Users that are interested in lemmatization-lists are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆199Jun 6, 2025Updated 11 months ago
- 📂 Additional lookup tables and data resources for spaCy☆115Jun 4, 2025Updated 11 months ago
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆20May 13, 2026Updated last week
- Morphological Dictionaries for German Language☆32Apr 29, 2026Updated 3 weeks ago
- Gramadán: a computational grammar of Irish☆17Jan 23, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Repository for Frequency Word List Generator and processed files☆1,488Feb 7, 2022Updated 4 years ago
- A python module for English lemmatization and inflection.☆280Sep 14, 2023Updated 2 years ago
- ☆14Mar 30, 2026Updated last month
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆261Aug 21, 2025Updated 9 months ago
- Dice.com's relevancy feedback solr plugin created by Simon Hughes (Dice). Contains request handlers for doing MLT style recommendations, …☆23May 12, 2021Updated 5 years ago
- Curated list of Linguistic Resources for doing NLP & CL on Spanish☆349Jan 9, 2024Updated 2 years ago
- UD Greek☆22May 6, 2026Updated 2 weeks ago
- django-mdict是django实现的mdict词典查询工具。☆58Oct 21, 2024Updated last year
- German lemmatization with IWNLP as extension for spaCy☆27Apr 13, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Wiktionary dump file parser and multilingual data extractor☆1,159Updated this week
- generate a html or pdf or jpg file for specific words through a mdx dirctionary☆42Dec 11, 2023Updated 2 years ago
- Preliminary spaCy models for Latin☆14Oct 20, 2022Updated 3 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆112Updated this week
- ☆14Mar 30, 2023Updated 3 years ago
- Lemmatiser for Danish, Dutch, English, German, Polish, Romanian, Russian and tens of other languages, that uses affix rules (affix: prefi…☆37Jun 26, 2025Updated 10 months ago
- A library for fetching and reading Tatoeba's weekly exports☆24Feb 5, 2026Updated 3 months ago
- Emoji Favicon Toolkit - Set your favicon to emoji using canvas & cache as /favicon.ico with service workers☆16Mar 16, 2019Updated 7 years ago
- Hy-phen-ation made easy☆225Jan 5, 2026Updated 4 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Dec 17, 2021Updated 4 years ago
- ☆16Sep 13, 2016Updated 9 years ago
- Autojump for Total Commander !!☆13Nov 25, 2020Updated 5 years ago
- Access to lexical databases☆154Feb 11, 2026Updated 3 months ago
- [EMNLP 2020] Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆395Nov 7, 2023Updated 2 years ago
- ☆10Aug 23, 2023Updated 2 years ago
- TODO less, DO more. Keep your code clean without changing the way you code.☆37Dec 8, 2019Updated 6 years ago
- 🎀 JavaScript API for spaCy with Python REST API☆201Sep 16, 2023Updated 2 years ago
- Code for morphological transformations☆29Jun 3, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This demo showcase the use of onnxruntime-rs with a GPU on CUDA 11 to run Bert in a data pipeline with Rust.☆16Feb 7, 2022Updated 4 years ago
- linguistics backend☆42Mar 25, 2023Updated 3 years ago
- This plugin provides a useful feature for multi-language☆14Jul 15, 2022Updated 3 years ago
- Access a database of word frequencies, in various natural languages.☆1,661Jan 4, 2025Updated last year
- ☆13Oct 13, 2012Updated 13 years ago
- ☆18Dec 10, 2024Updated last year
- Benchmarking Elasticsearch vs. Opensearch☆24Sep 15, 2025Updated 8 months ago