michmech / lemmatization-listsView external linksLinks
Machine-readable lists of lemma-token pairs in 23 languages.
☆358Jan 29, 2022Updated 4 years ago
Alternatives and similar repositories for lemmatization-lists
Users that are interested in lemmatization-lists are comparing it to the libraries listed below
Sorting:
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆185Jun 6, 2025Updated 8 months ago
- 📂 Additional lookup tables and data resources for spaCy☆113Jun 4, 2025Updated 8 months ago
- A python module for English lemmatization and inflection.☆273Sep 14, 2023Updated 2 years ago
- Repository for Frequency Word List Generator and processed files☆1,442Feb 7, 2022Updated 4 years ago
- ☆16Sep 13, 2016Updated 9 years ago
- Breaks a word into syllables using an LSTM-based neural network.☆20Aug 14, 2023Updated 2 years ago
- django-mdict是django实现的mdict词典查询工具。☆56Oct 21, 2024Updated last year
- Fast Python Vowpal Wabbit wrapper☆13Mar 31, 2021Updated 4 years ago
- Modularized version of the Pink Trombone voice synthesizer☆12May 5, 2019Updated 6 years ago
- DELPH-IN Documentation☆29Feb 1, 2026Updated last week
- This repository contains the Potsdam Textbook Corpus (PoTeC) which is a natural reading eye-tracking corpus.☆14Dec 31, 2025Updated last month
- Java implmentation of LemmaGen project☆11Feb 15, 2022Updated 3 years ago
- This plugin provides a useful feature for multi-language☆14Jul 15, 2022Updated 3 years ago
- BabelNet (and WordNet) sense embedding trained with Word2Vec and FastText☆10Sep 3, 2019Updated 6 years ago
- SCTE-35 Inserter for MPEGTS. SuperKabuki is SCTE-35 Packet Injection for Ad Insertion, powered by threefive.☆12Sep 13, 2024Updated last year
- A script for converting DSL format dictionaries compatible with GoldenDict to the Migaku Dictionary format.☆15May 9, 2025Updated 9 months ago
- ☆13Mar 30, 2023Updated 2 years ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆120Oct 20, 2025Updated 3 months ago
- a GUI to help visually tweaking Solr edismax☆19Apr 8, 2015Updated 10 years ago
- Solr SearchComponent for altering and re-executing queries that product poor results☆14May 12, 2021Updated 4 years ago
- SNoRe: Scalable Unsupervised Learning of Symbolic Node Representations☆11Sep 26, 2023Updated 2 years ago
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆33Jul 5, 2019Updated 6 years ago
- About 6,500 Irish lemmas ordered by corpus frequency, with noise removed.☆37May 11, 2018Updated 7 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆58Jul 1, 2021Updated 4 years ago
- Code for morphological transformations☆29Jun 3, 2017Updated 8 years ago
- Tools and Data for the CMU Pronouncing Dictionary☆16Dec 9, 2018Updated 7 years ago
- Palace app for Android☆11Updated this week
- Preliminary spaCy models for Latin☆14Oct 20, 2022Updated 3 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆62Apr 25, 2015Updated 10 years ago
- Curated list of Linguistic Resources for doing NLP & CL on Spanish☆348Jan 9, 2024Updated 2 years ago
- Access a database of word frequencies, in various natural languages.☆1,617Jan 4, 2025Updated last year
- ☆15Jun 16, 2020Updated 5 years ago
- Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages☆15Apr 11, 2020Updated 5 years ago
- The core NLP library for automatic question generation☆17Mar 7, 2017Updated 8 years ago
- 这个项目会把灵格斯Lingoes的LD2文件转制成星际王StarDict的格式。☆16Jan 24, 2017Updated 9 years ago
- X (weighted / probabilistic) Context-Free Grammars☆25Jan 30, 2024Updated 2 years ago
- A simple composable rule engine, built in object-oriented way, to reduce manual work.☆19Jun 9, 2023Updated 2 years ago
- PYthon Automated Term Extraction☆318Feb 8, 2023Updated 3 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Dec 17, 2021Updated 4 years ago