kuhumcst/cstlemma

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kuhumcst/cstlemma)

kuhumcst / cstlemma

Lemmatiser for Danish, Dutch, English, German, Polish, Romanian, Russian and tens of other languages, that uses affix rules (affix: prefix, infix, suffix, circumfix). Rules are obtained by supervised learning from a full form - lemma list.

☆37

Alternatives and similar repositories for cstlemma

Users that are interested in cstlemma are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

michmech / BuNaMo
View on GitHub
Bunachar Náisiúnta Moirfeolaíochta | Irish National Morphology Database
☆28Jun 10, 2024Updated 2 years ago
nlesc-sherlock / spaCy-dutch
View on GitHub
Repository for creating models, vocabulary and other necessities for Dutch in Spacey
☆11Dec 15, 2016Updated 9 years ago
cdcrabtree / nomine
View on GitHub
Classify names by gender, U.S. ethnicity, or leaf nationality
☆19Oct 13, 2018Updated 7 years ago
antonisa / unimorph_inflect
View on GitHub
A python library for easily querying morphological inflection models trained on Unimorph
☆13Oct 23, 2022Updated 3 years ago
antongrau / soc.elite
View on GitHub
The Danish Elite Network
☆21Dec 14, 2018Updated 7 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
angeloskath / supervised-lda
View on GitHub
A flexible variational inference LDA library.
☆23Mar 15, 2019Updated 7 years ago
aaronrkaufman / stringmatch
View on GitHub
Implements the Adaptive Fuzzy String Matching model from Kaufman & Klevs
☆11Nov 28, 2022Updated 3 years ago
nytud / emMorph
View on GitHub
☆17Jan 20, 2022Updated 4 years ago
njtierney / mputr
View on GitHub
Package for handling multiple imputations in a tidy format
☆12Jul 10, 2019Updated 7 years ago
ishanagr / ethnicity
View on GitHub
Determines the ethnicity based on your last name
☆10Aug 17, 2014Updated 11 years ago
sorenlind / lemmy
View on GitHub
🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪
☆80Sep 20, 2021Updated 4 years ago
kmunger / Replication-Materials-for-Tweetment-Effects-on-the-Tweeted
View on GitHub
☆10Nov 2, 2016Updated 9 years ago
sebastianbarfort / sds
View on GitHub
Social Data Science, course at University of Copenhagen
☆13Jul 26, 2017Updated 9 years ago
davben / arvig
View on GitHub
An R data package containing georeferenced events of right-wing violence in Germany from 2014 onwards
☆11Jun 27, 2018Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kbenoit / CSTA-APSR
View on GitHub
Replication Materials for "Crowd-Sourced Text Analysis" APSR (2016) 110(2): 278-295.
☆11Oct 28, 2017Updated 8 years ago
brentonk / psci8357
View on GitHub
PSCI 8357: Statistics for Political Research II
☆11Apr 21, 2016Updated 10 years ago
fsolt / pewdata
View on GitHub
Reproducible Retrieval of Pew Research Center Datasets in R
☆10Apr 14, 2021Updated 5 years ago
sckott / request
View on GitHub
http requests DSL for R
☆36Jun 16, 2020Updated 6 years ago
ccgilroy / word-embeddings-workshop
View on GitHub
CSS workshop on word embeddings for the social sciences, 3/19/21
☆12Mar 19, 2021Updated 5 years ago
asphalt-framework / asphalt-serialization
View on GitHub
Serialization component for the Asphalt framework
☆11Jul 20, 2026Updated last week
WladimirSidorenko / SentiLex
View on GitHub
Sentiment Lexicon Generation Suite
☆15Dec 4, 2017Updated 8 years ago
yoshuawuyts / virtual-streamgraph
View on GitHub
Create a virtual-dom streamgraph
☆16Sep 14, 2017Updated 8 years ago
nap / jaro-winkler-distance
View on GitHub
Finds a non-euclidean distance or similarity between two strings.
☆29May 12, 2026Updated 2 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
matthewjdenny / PPOL_628_Text_As_Data
View on GitHub
PPOL 628 Spring 2020 Course Webpage
☆16Nov 11, 2020Updated 5 years ago
bendichter / tenseflow
View on GitHub
Change the tense of any text
☆41Apr 16, 2024Updated 2 years ago
themiurgo / twitterstream-downloader
View on GitHub
Twitter stream and social network crawling tools
☆17Nov 17, 2016Updated 9 years ago
hrbrmstr / elpresidente
View on GitHub
🇺🇸 Search and Extract Corpus Elements from 'The American Presidency Project'
☆20Apr 23, 2018Updated 8 years ago
ropensci-archive / extractr
View on GitHub
ARCHIVED Extract Text from 'PDFs'
☆21May 10, 2022Updated 4 years ago
reckart / tt4j
View on GitHub
TreeTagger for Java
☆18Oct 17, 2024Updated last year
bkuhlmann / pennyworth
View on GitHub
A command line interface for augmented Alfred workflows.
☆16Jul 17, 2026Updated last week
igorti / igorti.github.io
View on GitHub
☆11May 2, 2020Updated 6 years ago
simonmunzert / hitler-speeches
View on GitHub
Supplementary and replication materials for paper "Examining a Most Likely Case for Strong Campaign Effects: Hitler's Speeches and the Ri…
☆15Jun 6, 2018Updated 8 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
UPB-SS1 / PyCrowdTangle
View on GitHub
A Python Wrapper To Retrieve Data From The CrowdTangle API
☆11Mar 26, 2026Updated 4 months ago
d4tagirl / John-Oliver-sentiment-analysis
View on GitHub
Code for my blog post about text mining Last Week Tonight comments
☆14Jun 10, 2017Updated 9 years ago
DuyguA / german-morph-dictionaries
View on GitHub
Morphological Dictionaries for German Language
☆32Apr 29, 2026Updated 3 months ago
justingrimmer / WUSTL
View on GitHub
Text as Data Material for WashU Course
☆15Nov 7, 2017Updated 8 years ago
christophergandrud / EIUCrisesMeasure
View on GitHub
Real-time perceptions of financial market stress measured using kernel PCA
☆15Apr 10, 2017Updated 9 years ago
alexeyev / mystem-scala
View on GitHub
Morphological analyzer `mystem` (Russian language) wrapper for JVM languages
☆26Jun 29, 2026Updated last month
jonsafari / tok-tok
View on GitHub
A fast, simple, multilingual tokenizer
☆29May 24, 2017Updated 9 years ago