snowballstem / snowball-data
Test data for snowball stemming algorithms
☆32Updated this week
Alternatives and similar repositories for snowball-data:
Users that are interested in snowball-data are comparing it to the libraries listed below
- Website source for snowballstem.org☆17Updated last week
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆68Updated last month
- Snowball compiler and stemming algorithms☆775Updated last week
- SymSpellCompound: compound aware automatic spelling correction☆66Updated 7 years ago
- AQMAR Arabic Tagger: Sequence tagger with cost-augmented structured perceptron training☆42Updated 11 years ago
- The NLG tool for Finnish☆22Updated last year
- A fast and accurate POS and morphological tagging toolkit (EACL 2014)☆141Updated 5 years ago
- ElixirFM Functional Arabic Morphology☆43Updated 2 years ago
- Transliteration package for Indian scripts☆16Updated 8 years ago
- Machine translation for the real world☆23Updated 5 years ago
- enchant spellchecking library☆360Updated 2 months ago
- eXternally configurable REference and Non Named Entity Recognizer☆17Updated 9 months ago
- Software and resources for natural language processing.☆131Updated 8 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- Hierarchical phrase-based machine translation system☆32Updated 10 years ago
- ☆10Updated 9 years ago
- *Deprecated* A fast and accurate part-of-speech tagger for TextBlob.☆102Updated 9 years ago
- Various utilities for processing the data.☆208Updated this week
- A Javascript Implementation of the Porter Stemmer☆96Updated 3 years ago
- A Java UIMA-based toolbox for multilingual and efficient terminology extraction an multilingual term alignment☆38Updated 7 years ago
- 📖 Library that provides ways to read from and iterate through the Wikibase entities in a Wikibase Repository JSON dump☆74Updated 8 months ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆191Updated 4 years ago
- Web service for implementing a large-scale translation memory☆91Updated 3 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆112Updated 2 months ago
- Treex NLP framework☆32Updated this week
- Official releases of the PROIEL treebank of ancient Indo-European languages☆37Updated last year
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆63Updated 10 months ago
- A Python module for interfacing with the Treetagger by Helmut Schmid.☆75Updated 3 years ago
- Python framework for processing Universal Dependencies data☆55Updated this week