snowballstem / snowball
Snowball compiler and stemming algorithms
☆749Updated last week
Related projects: ⓘ
- Compact Language Detector 2☆836Updated 3 years ago
- Python stemming library using snowball stemmers☆242Updated 2 weeks ago
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆160Updated 3 years ago
- enchant spellchecking library☆340Updated 2 weeks ago
- ☆774Updated last year
- Multilingual text (NLP) processing toolkit☆2,307Updated 10 months ago
- SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm☆3,121Updated 5 months ago
- Modern spell checking library - accurate, fast, multi-language☆605Updated 3 weeks ago
- DBpedia Spotlight is a tool for automatically annotating mentions of DBpedia resources in text.☆755Updated 6 years ago
- All languages stopwords collection☆420Updated 8 months ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆323Updated 2 years ago
- This is a language detection library implemented in plain Java. (aliases: language identification, language guessing)☆727Updated 5 years ago
- Lexical database of any language☆174Updated 2 years ago
- The most popular spellchecking library.☆2,098Updated last month
- Stopwords for 50 languages in JSON format☆423Updated last year
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆791Updated 2 weeks ago
- Test data for snowball stemming algorithms☆29Updated 2 weeks ago
- Apache OpenNLP☆1,425Updated last week
- The Open English WordNet☆459Updated last week
- 🦆 Contextually-keyed word vectors☆1,617Updated 6 months ago
- Python Implementations of Word Sense Disambiguation (WSD) Technologies.☆743Updated 2 years ago
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆358Updated last week
- Default English stopword lists from many different sources☆288Updated last year
- A simple and fast discriminative sequence labeling toolkit ( http://wapiti.limsi.fr )☆251Updated last year
- MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, informat…☆980Updated 6 months ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆364Updated last year
- Language Detection with Infinity-gram☆230Updated 9 years ago
- Heuristic based boilerplate removal tool☆717Updated 4 months ago
- Access a database of word frequencies, in various natural languages.☆699Updated 2 months ago
- spellchecking library for python☆597Updated 3 months ago