Lemmatiser for Danish, Dutch, English, German, Polish, Romanian, Russian and tens of other languages, that uses affix rules (affix: prefix, infix, suffix, circumfix). Rules are obtained by supervised learning from a full form - lemma list.
☆37Jun 26, 2025Updated 11 months ago
Alternatives and similar repositories for cstlemma
Users that are interested in cstlemma are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Modernized version of Eric Brill's Part Of Speech tagger.☆15May 6, 2025Updated last year
- Bunachar Náisiúnta Moirfeolaíochta | Irish National Morphology Database☆27Jun 10, 2024Updated 2 years ago
- ACL Rolling Review website☆11Jun 11, 2026Updated last week
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Oct 13, 2018Updated 7 years ago
- A python library for easily querying morphological inflection models trained on Unimorph☆13Oct 23, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- The Danish Elite Network☆21Dec 14, 2018Updated 7 years ago
- Twitter stream and social network crawling tools☆17Nov 17, 2016Updated 9 years ago
- ☆17Jan 20, 2022Updated 4 years ago
- Determines the ethnicity based on your last name☆10Aug 17, 2014Updated 11 years ago
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆80Sep 20, 2021Updated 4 years ago
- An R data package containing georeferenced events of right-wing violence in Germany from 2014 onwards☆11Jun 27, 2018Updated 7 years ago
- Replication Materials for "Crowd-Sourced Text Analysis" APSR (2016) 110(2): 278-295.☆11Oct 28, 2017Updated 8 years ago
- Crowd sourcing neighborhood boundaries, stories, and descriptions. Making pretty maps.☆13Oct 6, 2015Updated 10 years ago
- PSCI 8357: Statistics for Political Research II☆11Apr 21, 2016Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Fan plots for plotting distributions in ggplot2☆38Sep 15, 2023Updated 2 years ago
- CSS workshop on word embeddings for the social sciences, 3/19/21☆12Mar 19, 2021Updated 5 years ago
- Reproducible Retrieval of Pew Research Center Datasets in R☆10Apr 14, 2021Updated 5 years ago
- Simple Lexer and Parser in F#☆21Sep 4, 2020Updated 5 years ago
- ☆10Nov 2, 2016Updated 9 years ago
- Give the make script a ttf file and tex file, and you'll get a pdf of the tex using that font.☆11Jul 8, 2012Updated 13 years ago
- ☆13Jan 6, 2018Updated 8 years ago
- PyMoves☆54Oct 11, 2017Updated 8 years ago
- ☆18Jul 1, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Change the tense of any text☆41Apr 16, 2024Updated 2 years ago
- A Brief Introduction to Text Analysis Using R☆15Oct 27, 2016Updated 9 years ago
- 🇺🇸 Search and Extract Corpus Elements from 'The American Presidency Project'☆20Apr 23, 2018Updated 8 years ago
- ARCHIVED Extract Text from 'PDFs'☆21May 10, 2022Updated 4 years ago
- Create a virtual-dom streamgraph☆16Sep 14, 2017Updated 8 years ago
- ☆11May 2, 2020Updated 6 years ago
- Code for my blog post about text mining Last Week Tonight comments☆14Jun 10, 2017Updated 9 years ago
- A Python Wrapper To Retrieve Data From The CrowdTangle API☆11Mar 26, 2026Updated 2 months ago
- Morphological Dictionaries for German Language☆32Apr 29, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A fast, simple, multilingual tokenizer☆29May 24, 2017Updated 9 years ago
- Text as Data Material for WashU Course☆15Nov 7, 2017Updated 8 years ago
- Real-time perceptions of financial market stress measured using kernel PCA☆15Apr 10, 2017Updated 9 years ago
- A MessagePack-based storage extension to tinydb using the http://msgpack.org☆12Dec 23, 2017Updated 8 years ago
- Jieba 0.39 的 Java 复刻版,支持原版 Jieba 的所有核心功能☆12Feb 14, 2019Updated 7 years ago
- A handy CLI for executing code in a REPL in a separate pane☆20Oct 18, 2025Updated 8 months ago
- python metric functions, such as MAP, NDCG, AUC...☆10Jul 25, 2014Updated 11 years ago