snowballstem / snowballLinks
Snowball compiler and stemming algorithms
☆816Updated last week
Alternatives and similar repositories for snowball
Users that are interested in snowball are comparing it to the libraries listed below
Sorting:
- Compact Language Detector 2☆879Updated 4 years ago
- Python stemming library using snowball stemmers☆267Updated 3 months ago
- All languages stopwords collection☆463Updated last year
- Machine-readable lists of lemma-token pairs in 23 languages.☆349Updated 3 years ago
- SCOWL (and friends).☆450Updated 4 months ago
- FreeLing project source code☆260Updated 2 years ago
- The most popular spellchecking library.☆2,377Updated last month
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆161Updated 5 years ago
- The CMU Link Grammar natural language parser☆403Updated 3 weeks ago
- Tools for finite state automata construction and dictionary-based morphological dictionaries. Includes Polish stemming dictionary.☆197Updated 2 years ago
- The Open English WordNet☆659Updated 2 weeks ago
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆390Updated 3 months ago
- Lexical database of any language☆184Updated 3 years ago
- ☆856Updated 2 years ago
- Universal Dependencies online documentation☆287Updated this week
- Test data for snowball stemming algorithms☆35Updated 3 weeks ago
- Modern spell checking library - accurate, fast, multi-language☆654Updated last year
- Automatically exported from code.google.com/p/foma☆124Updated 2 months ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆378Updated 2 years ago
- A multilingual, cross-domain temporal tagger developed at the Database Systems Research Group at Heidelberg University.☆362Updated 2 years ago
- This is a language detection library implemented in plain Java. (aliases: language identification, language guessing)☆760Updated 6 years ago
- The approximate regex matching library and agrep command line tool.☆865Updated 3 months ago
- Small strings compression library☆1,208Updated 6 years ago
- Carrot2: Text Clustering Algorithms and Applications☆835Updated 3 weeks ago
- Unitex/GramLab C++ Core☆22Updated last year
- ☆185Updated 7 years ago
- Access a database of word frequencies, in various natural languages.☆1,572Updated 10 months ago
- Fast Word Segmentation with Triangular Matrix☆83Updated 4 years ago
- Stopwords for 50 languages in JSON format☆431Updated 2 years ago
- A simple and fast discriminative sequence labeling toolkit ( http://wapiti.limsi.fr )☆255Updated 2 years ago