snowballstem / snowball-dataLinks
Test data for snowball stemming algorithms
☆33Updated last month
Alternatives and similar repositories for snowball-data
Users that are interested in snowball-data are comparing it to the libraries listed below
Sorting:
- Website source for snowballstem.org☆17Updated last week
- ElixirFM Functional Arabic Morphology☆43Updated 2 years ago
- A fast and accurate POS and morphological tagging toolkit (EACL 2014)☆141Updated 5 years ago
- Snowball compiler and stemming algorithms☆792Updated this week
- 📖 Library that provides ways to read from and iterate through the Wikibase entities in a Wikibase Repository JSON dump☆74Updated 10 months ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated 2 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆68Updated 3 months ago
- Mishtar: Named and temporal entities chunker☆13Updated 4 years ago
- Web service for implementing a large-scale translation memory☆90Updated 3 years ago
- Automatic Isnad tree visualisation☆13Updated 2 years ago
- French language support for TextBlob.☆59Updated 4 years ago
- Arabic prosody (Arud) or "Science of Poetry"☆42Updated 2 years ago
- Collection of various Arabic NLP and Text Processing Scripts and Utilities☆57Updated 11 years ago
- ☆35Updated 6 years ago
- Arabic roots list resource☆10Updated 6 years ago
- Machine translation for the real world☆23Updated 5 years ago
- AQMAR Arabic Tagger: Sequence tagger with cost-augmented structured perceptron training☆42Updated 11 years ago
- A Javascript Implementation of the Porter Stemmer☆96Updated 3 years ago
- A ruby gem that contains Natural Language Processing tools for Arabic.☆11Updated 10 years ago
- The NLG tool for Finnish☆23Updated last year
- A trend viewer written in Python/JavaScript☆21Updated 6 months ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Official releases of the TOROT treebank☆9Updated 5 years ago
- A command line version of Koja Stemmer (An Arabic rooting algorithm)☆19Updated 8 years ago
- Bilingual sentence aligner (Gale & Church, 1993)☆14Updated 6 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆193Updated 4 years ago
- Comparable documents miner: Arabic-English morphological analysis, text processing, n-gram features extraction, POS tagging, dictionary t…☆34Updated 8 years ago
- Fast corpus search engine originally made for the Corpus of Written Tatar language☆16Updated 5 years ago
- A Python module for interfacing with the Treetagger by Helmut Schmid.☆75Updated 3 years ago
- Official releases of the PROIEL treebank of ancient Indo-European languages