Machine-readable lists of lemma-token pairs in 23 languages.
☆362Jan 29, 2022Updated 4 years ago
Alternatives and similar repositories for lemmatization-lists
Users that are interested in lemmatization-lists are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆193Jun 6, 2025Updated 10 months ago
- 📂 Additional lookup tables and data resources for spaCy☆115Jun 4, 2025Updated 10 months ago
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆20Apr 24, 2026Updated last week
- Morphological Dictionaries for German Language☆32Updated this week
- Gramadán: a computational grammar of Irish☆17Jan 23, 2023Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Repository for Frequency Word List Generator and processed files☆1,479Feb 7, 2022Updated 4 years ago
- A python module for English lemmatization and inflection.☆278Sep 14, 2023Updated 2 years ago
- Lemmatizer for text in English. Inspired by Python's nltk.corpus.reader.wordnet.morphy☆112Oct 14, 2021Updated 4 years ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Dec 15, 2023Updated 2 years ago
- ☆14Mar 30, 2026Updated last month
- (AAAI'20) The source code for the paper "Joint Parsing and Generation for Abstractive Summarization".☆24Apr 22, 2020Updated 6 years ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆261Aug 21, 2025Updated 8 months ago
- 📦 English word lemmatizer☆17May 3, 2022Updated 3 years ago
- Dice.com's relevancy feedback solr plugin created by Simon Hughes (Dice). Contains request handlers for doing MLT style recommendations, …☆23May 12, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Rust bindings for the spaCy library.☆24Dec 11, 2022Updated 3 years ago
- Curated list of Linguistic Resources for doing NLP & CL on Spanish☆349Jan 9, 2024Updated 2 years ago
- django-mdict是django实现的mdict词典查询工具。☆56Oct 21, 2024Updated last year
- Auto tagging with OpenNPL☆16Nov 20, 2013Updated 12 years ago
- German lemmatization with IWNLP as extension for spaCy☆27Apr 13, 2026Updated 2 weeks ago
- Shared ispell dictionary (stored in shared segment, used by multiple connections)☆12Mar 24, 2026Updated last month
- WordNet behind a REST interface☆13Apr 9, 2025Updated last year
- A project to collect all tamil nouns☆12Dec 14, 2024Updated last year
- Wiktionary dump file parser and multilingual data extractor☆1,149Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆18Jan 15, 2026Updated 3 months ago
- Tools and Data for the CMU Pronouncing Dictionary☆16Dec 9, 2018Updated 7 years ago
- Preliminary spaCy models for Latin☆14Oct 20, 2022Updated 3 years ago
- spaCy-to-naf converter☆21Jun 10, 2025Updated 10 months ago
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆33Jul 5, 2019Updated 6 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆109Apr 13, 2026Updated 2 weeks ago
- Lemmatiser for Danish, Dutch, English, German, Polish, Romanian, Russian and tens of other languages, that uses affix rules (affix: prefi…☆37Jun 26, 2025Updated 10 months ago
- Small string compression using smaz compression algorithm. Fast, because it's in C. Supports Python 3+☆13Oct 18, 2025Updated 6 months ago
- A library for fetching and reading Tatoeba's weekly exports☆24Feb 5, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Hy-phen-ation made easy☆223Jan 5, 2026Updated 3 months ago
- ☆16Sep 13, 2016Updated 9 years ago
- Example Swift 4.2 implementation of the Sidebar Use Case using NSOutlineView with CoreData, RxSwift and the whole layout aligned at Apple…☆12Jun 21, 2020Updated 5 years ago
- 🎀 JavaScript API for spaCy with Python REST API☆201Sep 16, 2023Updated 2 years ago
- linguistics backend☆42Mar 25, 2023Updated 3 years ago
- DSL and Gem for defining an AWS architecture☆33May 8, 2023Updated 2 years ago
- This plugin provides a useful feature for multi-language☆14Jul 15, 2022Updated 3 years ago