snowballstem / snowball-dataLinks
Test data for snowball stemming algorithms
☆38Updated last month
Alternatives and similar repositories for snowball-data
Users that are interested in snowball-data are comparing it to the libraries listed below
Sorting:
- Snowball compiler and stemming algorithms☆834Updated last month
- A fast and accurate POS and morphological tagging toolkit (EACL 2014)☆149Updated 5 years ago
- SCOWL (and friends).☆464Updated last week
- Transliteration package for Indian scripts☆16Updated 9 years ago
- Website source for snowballstem.org☆20Updated 3 weeks ago
- Lexical database of any language☆187Updated 3 years ago
- Compact Language Detector 2☆890Updated 4 years ago
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆161Updated 5 years ago
- Python stemming library using snowball stemmers☆275Updated last month
- Linguistica 5: Unsupervised Learning of Linguistic Structure☆32Updated 6 years ago
- The CMU Link Grammar natural language parser☆407Updated 3 months ago
- All languages stopwords collection☆476Updated 2 years ago
- Hunspell-based analysis for Elasticsearch☆85Updated last year
- A cloud-based, open-source system for writing and publishing dictionaries.☆99Updated 2 years ago
- English stopwords collection☆169Updated 9 years ago
- Miscellaneous materials for teaching NLP using NLTK☆36Updated 8 years ago
- A Python module for interfacing with the Treetagger by Helmut Schmid.☆76Updated 8 months ago
- thesaurus-manager app with graph database☆31Updated 9 years ago
- ISO Language Codes (639-1 and 639-2)☆105Updated last year
- Fast corpus search engine originally made for the Corpus of Written Tatar language☆17Updated 6 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆70Updated last month
- List of common stop words in various languages.☆345Updated 3 months ago
- Universal Dependencies online documentation☆288Updated this week
- Python scripts for retrieving CSV data from the Google Ngram Viewer and plotting it in XKCD style. The Python script for retrieving ngram…☆254Updated 5 years ago
- Automatically exported from code.google.com/p/guess-language☆54Updated 3 months ago
- FreeLing project source code☆260Updated 2 years ago
- NLTK Book☆421Updated 3 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆35Updated 2 years ago
- Automatically exported from code.google.com/p/foma☆128Updated 5 months ago
- The curation repository for the data behind Concepticon.☆42Updated this week