Lemmatiser for Danish, Dutch, English, German, Polish, Romanian, Russian and tens of other languages, that uses affix rules (affix: prefix, infix, suffix, circumfix). Rules are obtained by supervised learning from a full form - lemma list.
☆36Jun 26, 2025Updated 8 months ago
Alternatives and similar repositories for cstlemma
Users that are interested in cstlemma are comparing it to the libraries listed below
Sorting:
- ACL Rolling Review website☆11Feb 24, 2026Updated last week
- A python library for easily querying morphological inflection models trained on Unimorph☆13Oct 23, 2022Updated 3 years ago
- Modernized version of Eric Brill's Part Of Speech tagger.☆15May 6, 2025Updated 10 months ago
- Repository for creating models, vocabulary and other necessities for Dutch in Spacey☆11Dec 15, 2016Updated 9 years ago
- ☆16Jan 20, 2022Updated 4 years ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Oct 13, 2018Updated 7 years ago
- Sentida☆22Dec 14, 2021Updated 4 years ago
- Bunachar Náisiúnta Moirfeolaíochta | Irish National Morphology Database☆26Jun 10, 2024Updated last year
- Ubiflux Vigor ventilation system RS485 Modbus communications with Python☆11Feb 20, 2026Updated 2 weeks ago
- Morphological Dictionaries for German Language☆30Apr 6, 2018Updated 7 years ago
- Code for morphological transformations☆29Jun 3, 2017Updated 8 years ago
- A fast, simple, multilingual tokenizer☆29May 24, 2017Updated 8 years ago
- This repository is about how to build an SQLite version of the Arabic WordNet database.☆10Mar 19, 2019Updated 6 years ago
- A Python Wrapper To Retrieve Data From The CrowdTangle API☆11Jun 10, 2025Updated 8 months ago
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆79Sep 20, 2021Updated 4 years ago
- an approximate string matching or fuzzy-matching system for spelling correction, normalisation or post-OCR correction (mirror of https://…☆37Feb 10, 2026Updated 3 weeks ago
- Featurize words into orthographic and phonological vectors.☆41May 20, 2023Updated 2 years ago
- MG top-down beam parsing☆13Jul 2, 2018Updated 7 years ago
- Next word prediction based on N-gram language model☆12Jan 11, 2015Updated 11 years ago
- Search Volume for amazon completeion service☆13Feb 5, 2019Updated 7 years ago
- Flow map visualization tool☆11Mar 20, 2023Updated 2 years ago
- Hungarian tokenizer.☆14Mar 15, 2022Updated 3 years ago
- PyMoves☆54Oct 11, 2017Updated 8 years ago
- Named Entity (NER) annotations of the Hebrew Treebank (Haaretz newspaper) corpus, including: morpheme and token level NER labels, nested …☆10Dec 27, 2021Updated 4 years ago
- 学而思网校AI开放平台_SDK☆10Mar 7, 2020Updated 6 years ago
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Dec 19, 2022Updated 3 years ago
- A CNN audio classifier via spectrogram images.☆10Jul 21, 2017Updated 8 years ago
- Combining encoder-based language models☆11Nov 11, 2021Updated 4 years ago
- Twitter Sentiment Analysis☆10Jul 20, 2015Updated 10 years ago
- several algorithms for converting dependency structures into constituency structures.☆10Feb 7, 2022Updated 4 years ago
- A python script to remotely control a Sonos music player with NFC tags. Part of the Song Blocks project that allows a toddler to control…☆16May 10, 2016Updated 9 years ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Nov 5, 2020Updated 5 years ago
- A Python mountainous island heightmap generator, using domain warped fractaled simplex noise and the Julia Set. Yep, that's a lot of adje…☆14Jan 10, 2018Updated 8 years ago
- Tool for creating Kaldi nnet3 recipes using the International Phonetic Alphabet (IPA)☆10Jun 2, 2021Updated 4 years ago
- Fetch and parse the American Presidency Project's press-briefing and presidential-news-conference transcripts.☆11Aug 18, 2016Updated 9 years ago
- A wrapper, a lemmatizer and REST API implemented in Python for emMorph (Humor) Hungarian morphological analyzer☆11Feb 11, 2021Updated 5 years ago
- Convert CoNLL output of a dependency parser into a latex or graphviz tree☆12Mar 26, 2020Updated 5 years ago
- Grapheme-to-phoneme (G2P) conversion for Tamil / Kannada languages - a building block for Indic text-to-speech (TTS) systems☆12Nov 15, 2017Updated 8 years ago
- Jieba 0.39 的 Java 复刻版,支持原 版 Jieba 的所有核心功能☆12Feb 14, 2019Updated 7 years ago