Machine-readable lists of lemma-token pairs in 23 languages.
☆363Jan 29, 2022Updated 4 years ago
Alternatives and similar repositories for lemmatization-lists
Users that are interested in lemmatization-lists are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆36Mar 30, 2024Updated 2 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆206Updated this week
- 📂 Additional lookup tables and data resources for spaCy☆115Jun 4, 2025Updated last year
- Gramadán: a computational grammar of Irish☆17Jan 23, 2023Updated 3 years ago
- Repository for Frequency Word List Generator and processed files☆1,505Feb 7, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A python module for English lemmatization and inflection.☆280Sep 14, 2023Updated 2 years ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Dec 15, 2023Updated 2 years ago
- Hunspell dictionaries for PostgreSQL☆68Nov 25, 2019Updated 6 years ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆261Aug 21, 2025Updated 10 months ago
- Dice.com's relevancy feedback solr plugin created by Simon Hughes (Dice). Contains request handlers for doing MLT style recommendations, …☆23May 12, 2021Updated 5 years ago
- Curated list of Linguistic Resources for doing NLP & CL on Spanish☆351Jan 9, 2024Updated 2 years ago
- UD Greek☆22May 6, 2026Updated last month
- NetBSD cdb (constant database) library☆14May 24, 2019Updated 7 years ago
- German lemmatization with IWNLP as extension for spaCy☆27Apr 13, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Shared ispell dictionary (stored in shared segment, used by multiple connections)☆12May 19, 2026Updated last month
- A project to collect all tamil nouns☆12Dec 14, 2024Updated last year
- Wiktionary dump file parser and multilingual data extractor☆1,197Jun 23, 2026Updated last week
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆18Jan 15, 2026Updated 5 months ago
- spaCy-to-naf converter☆21Jun 10, 2025Updated last year
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆112Jun 24, 2026Updated last week
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆34Jul 5, 2019Updated 6 years ago
- ☆14Mar 30, 2023Updated 3 years ago
- Lemmatiser for Danish, Dutch, English, German, Polish, Romanian, Russian and tens of other languages, that uses affix rules (affix: prefi…☆37Updated this week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Small string compression using smaz compression algorithm. Fast, because it's in C. Supports Python 3+☆13Oct 18, 2025Updated 8 months ago
- A library for fetching and reading Tatoeba's weekly exports☆24Feb 5, 2026Updated 4 months ago
- BaseTerm is an open-source and free to use terminology management system built with the primary goal of natively supporting the most popu…☆11Sep 16, 2017Updated 8 years ago
- Hy-phen-ation made easy☆229Jun 19, 2026Updated last week
- ☆16Sep 13, 2016Updated 9 years ago
- MorphoDiTa: Morphologic Dictionary and Tagger☆82Jan 28, 2026Updated 5 months ago
- Autojump for Total Commander !!☆13Nov 25, 2020Updated 5 years ago
- [EMNLP 2020] Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆398Nov 7, 2023Updated 2 years ago
- 🎀 JavaScript API for spaCy with Python REST API☆201Sep 16, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- linux_logo in 26+ kinds of assembly language☆17Jul 17, 2023Updated 2 years ago
- Code for morphological transformations☆29Jun 3, 2017Updated 9 years ago
- This demo showcase the use of onnxruntime-rs with a GPU on CUDA 11 to run Bert in a data pipeline with Rust.☆16Feb 7, 2022Updated 4 years ago
- linguistics backend☆42Mar 25, 2023Updated 3 years ago
- DSL and Gem for defining an AWS architecture☆33May 8, 2023Updated 3 years ago
- CouchDB conflict resolution sample code☆17Oct 20, 2017Updated 8 years ago
- Access a database of word frequencies, in various natural languages.☆1,681Jan 4, 2025Updated last year