snowballstem / snowball-dataLinks
Test data for snowball stemming algorithms
☆35Updated 3 weeks ago
Alternatives and similar repositories for snowball-data
Users that are interested in snowball-data are comparing it to the libraries listed below
Sorting:
- Snowball compiler and stemming algorithms☆816Updated last week
- 📖 Library that provides ways to read from and iterate through the Wikibase entities in a Wikibase Repository JSON dump☆72Updated last year
- SCOWL (and friends).☆450Updated 4 months ago
- A fast and accurate POS and morphological tagging toolkit (EACL 2014)☆149Updated 5 years ago
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆161Updated 5 years ago
- Transliteration package for Indian scripts☆16Updated 8 years ago
- ElixirFM Functional Arabic Morphology☆44Updated 2 years ago
- Python stemming library using snowball stemmers☆267Updated 3 months ago
- Latin text dataset for machine learning and procedural text generation☆19Updated last year
- Morphosyntactic tagger for Norwegian bokmål and nynorsk☆29Updated 2 years ago
- A Javascript Implementation of the Porter Stemmer☆96Updated 4 years ago
- List of common stop words in various languages.☆339Updated 3 weeks ago
- A Python module for interfacing with the Treetagger by Helmut Schmid.☆76Updated 5 months ago
- All languages stopwords collection☆463Updated last year
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆197Updated 5 years ago
- Assem's Arabic Light Stemmer is a snowball-based stemming algorithm for Arabic aimed mainly to improve search.☆149Updated 3 years ago
- Simple Python Wrapper around MediaWiki API☆30Updated 3 years ago
- The CMU Link Grammar natural language parser☆403Updated 3 weeks ago
- Get list of common stop words in various languages in Python☆157Updated 2 weeks ago
- Software and resources for natural language processing.☆131Updated 9 years ago
- French language support for TextBlob.☆59Updated 5 years ago
- Linguistica 5: Unsupervised Learning of Linguistic Structure☆32Updated 6 years ago
- FreeLing project source code☆260Updated 2 years ago
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 9 years ago
- Open morphology for Finnish☆94Updated 2 months ago
- Lexical database of any language☆184Updated 3 years ago
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆390Updated 3 months ago
- Web service for implementing a large-scale translation memory☆90Updated 4 years ago
- Machine translation for the real world☆23Updated 5 years ago
- IXA pipes Named Entity Tagger (http://ixa2.si.ehu.es/ixa-pipes).☆33Updated 6 years ago