snowballstem / snowballLinks
Snowball compiler and stemming algorithms
☆804Updated this week
Alternatives and similar repositories for snowball
Users that are interested in snowball are comparing it to the libraries listed below
Sorting:
- Compact Language Detector 2☆870Updated 4 years ago
- FreeLing project source code☆258Updated 2 years ago
- Machine-readable lists of lemma-token pairs in 23 languages.☆342Updated 3 years ago
- ☆842Updated 2 years ago
- All languages stopwords collection☆451Updated last year
- The Open English WordNet☆598Updated last month
- English stopwords collection☆163Updated 8 years ago
- Lexical database of any language☆183Updated 3 years ago
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆162Updated 4 years ago
- The CMU Link Grammar natural language parser☆399Updated 4 months ago
- Universal Dependencies online documentation☆288Updated last week
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆386Updated 2 weeks ago
- Python stemming library using snowball stemmers☆263Updated 2 months ago
- List of common stop words in various languages.☆337Updated 2 years ago
- It's just a simple regex benchmark of different programming languages.☆323Updated last year
- The approximate regex matching library and agrep command line tool.☆847Updated 2 weeks ago
- SCOWL (and friends).☆437Updated 3 weeks ago
- This is a language detection library implemented in plain Java. (aliases: language identification, language guessing)☆755Updated 6 years ago
- Test data for snowball stemming algorithms☆34Updated last month
- PISA: Performant Indexes and Search for Academia☆1,007Updated last month
- Automatically exported from code.google.com/p/universal-pos-tags☆129Updated 3 years ago
- A multilingual, cross-domain temporal tagger developed at the Database Systems Research Group at Heidelberg University.☆353Updated 2 years ago
- Terrier IR Platform☆264Updated 3 weeks ago
- A simple and fast discriminative sequence labeling toolkit ( http://wapiti.limsi.fr )☆254Updated 2 years ago
- The most popular spellchecking library.☆2,304Updated 2 weeks ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆376Updated 2 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆197Updated 4 years ago
- Language Detection with Infinity-gram☆230Updated 10 years ago
- Stopwords for 50 languages in JSON format☆432Updated 2 years ago
- ☆184Updated 6 years ago