Machine-readable lists of lemma-token pairs in 23 languages.
☆362Jan 29, 2022Updated 4 years ago
Alternatives and similar repositories for lemmatization-lists
Users that are interested in lemmatization-lists are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆203Jun 1, 2026Updated last week
- 📂 Additional lookup tables and data resources for spaCy☆115Jun 4, 2025Updated last year
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆20Updated this week
- Gramadán: a computational grammar of Irish☆17Jan 23, 2023Updated 3 years ago
- Repository for Frequency Word List Generator and processed files☆1,499Feb 7, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A python module for English lemmatization and inflection.☆280Sep 14, 2023Updated 2 years ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Dec 15, 2023Updated 2 years ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆261Aug 21, 2025Updated 9 months ago
- Dice.com's relevancy feedback solr plugin created by Simon Hughes (Dice). Contains request handlers for doing MLT style recommendations, …☆23May 12, 2021Updated 5 years ago
- Rust bindings for the spaCy library.☆24Dec 11, 2022Updated 3 years ago
- Curated list of Linguistic Resources for doing NLP & CL on Spanish☆350Jan 9, 2024Updated 2 years ago
- UD Greek☆22May 6, 2026Updated last month
- django-mdict是django实现的mdict词典查询工具。☆56Oct 21, 2024Updated last year
- German lemmatization with IWNLP as extension for spaCy☆27Apr 13, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- a service to read mdx/mdd file and provide http interface☆261Jul 2, 2021Updated 4 years ago
- Wiktionary dump file parser and multilingual data extractor☆1,176Jun 4, 2026Updated last week
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆18Jan 15, 2026Updated 4 months ago
- generate a html or pdf or jpg file for specific words through a mdx dirctionary☆42Dec 11, 2023Updated 2 years ago
- Tools and Data for the CMU Pronouncing Dictionary☆16Dec 9, 2018Updated 7 years ago
- Bunachar Náisiúnta Moirfeolaíochta | Irish National Morphology Database☆27Jun 10, 2024Updated 2 years ago
- Preliminary spaCy models for Latin☆14Oct 20, 2022Updated 3 years ago
- spaCy-to-naf converter☆21Jun 10, 2025Updated last year
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆112May 27, 2026Updated 2 weeks ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆14Mar 30, 2023Updated 3 years ago
- Lemmatiser for Danish, Dutch, English, German, Polish, Romanian, Russian and tens of other languages, that uses affix rules (affix: prefi…☆37Jun 26, 2025Updated 11 months ago
- Small string compression using smaz compression algorithm. Fast, because it's in C. Supports Python 3+☆13Oct 18, 2025Updated 7 months ago
- A library for fetching and reading Tatoeba's weekly exports☆24Feb 5, 2026Updated 4 months ago
- Hy-phen-ation made easy☆228Jan 5, 2026Updated 5 months ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Dec 17, 2021Updated 4 years ago
- Autojump for Total Commander !!☆13Nov 25, 2020Updated 5 years ago
- Access to lexical databases☆155Feb 11, 2026Updated 4 months ago
- Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages☆15Apr 11, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 🎀 JavaScript API for spaCy with Python REST API☆201Sep 16, 2023Updated 2 years ago
- Code for morphological transformations☆29Jun 3, 2017Updated 9 years ago
- linguistics backend☆42Mar 25, 2023Updated 3 years ago
- words frequency top100k from BNC/ANC/COCA, dsl format, for goldendict☆65Dec 17, 2016Updated 9 years ago
- Access a database of word frequencies, in various natural languages.☆1,669Jan 4, 2025Updated last year
- Detect and classify pagination links☆15Sep 9, 2020Updated 5 years ago
- A script for converting DSL format dictionaries compatible with GoldenDict to the Migaku Dictionary format.☆15May 9, 2025Updated last year