snowballstem / snowball-data
Test data for snowball stemming algorithms
☆33Updated this week
Alternatives and similar repositories for snowball-data:
Users that are interested in snowball-data are comparing it to the libraries listed below
- Transliteration package for Indian scripts☆17Updated 8 years ago
- 📖 Library that provides ways to read from and iterate through the Wikibase entities in a Wikibase Repository JSON dump☆74Updated 9 months ago
- Snowball compiler and stemming algorithms☆788Updated this week
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆68Updated 3 months ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆192Updated 4 years ago
- ElixirFM Functional Arabic Morphology☆43Updated 2 years ago
- Crawler for linguistic corpora☆204Updated last year
- A Javascript Implementation of the Porter Stemmer☆96Updated 3 years ago
- The curation repository for the data behind Concepticon.☆38Updated last week
- Software and resources for natural language processing.☆131Updated 8 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated 2 years ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Official releases of the PROIEL treebank of ancient Indo-European languages☆36Updated 2 years ago
- NameTag: Named Entity Tagger☆38Updated 8 months ago
- SCOWL (and friends).☆419Updated 3 weeks ago
- Lexical database of any language☆179Updated 2 years ago
- AQMAR Arabic Tagger: Sequence tagger with cost-augmented structured perceptron training☆42Updated 11 years ago
- *Deprecated* A fast and accurate part-of-speech tagger for TextBlob.☆102Updated 9 years ago
- SymSpellCompound: compound aware automatic spelling correction☆66Updated 7 years ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆90Updated last year
- Libraries and command-line tools for metrical analysis of epic Greek hexameter☆27Updated 7 years ago
- A web framework to display Cross Linguistic Linked Data.☆56Updated 2 months ago
- The NLG tool for Finnish☆23Updated last year
- Open morphology for Finnish☆90Updated last week
- Language Detection with Infinity-gram☆229Updated 9 years ago
- Web service for implementing a large-scale translation memory☆90Updated 3 years ago
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆162Updated 4 years ago
- Automatic Isnad tree visualisation☆13Updated last year
- Lexical data at Unicode☆68Updated 8 months ago
- Transcripts for the audio files in the Berkeley Restaurant Project (BeRP) corpus☆23Updated 10 months ago