snowballstem / snowball-dataLinks
Test data for snowball stemming algorithms
β34Updated last month
Alternatives and similar repositories for snowball-data
Users that are interested in snowball-data are comparing it to the libraries listed below
Sorting:
- Transliteration package for Indian scriptsβ16Updated 8 years ago
- π Library that provides ways to read from and iterate through the Wikibase entities in a Wikibase Repository JSON dumpβ74Updated last year
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic prβ¦β69Updated 3 weeks ago
- A fast and accurate POS and morphological tagging toolkit (EACL 2014)β141Updated 5 years ago
- The curation repository for the data behind Concepticon.β39Updated last week
- Official releases of the PROIEL treebank of ancient Indo-European languagesβ37Updated 2 years ago
- Open morphology for Finnishβ91Updated 2 months ago
- A trend viewer written in Python/JavaScriptβ21Updated 8 months ago
- CRF-based Morphological Tagging and Lemmatizationβ37Updated 5 years ago
- Morphological analyzer and lemmatizer for Latin.β27Updated 5 months ago
- A Java UIMA-based toolbox for multilingual and efficient terminology extraction an multilingual term alignmentβ40Updated 7 years ago
- The CMU Link Grammar natural language parserβ397Updated 3 months ago
- SCOWL (and friends).β432Updated this week
- A NoSketch Engine Docker image which is easy to useβ19Updated last month
- Bilingual sentence aligner (Gale & Church, 1993)β14Updated 6 years ago
- Fast corpus search engine originally made for the Corpus of Written Tatar languageβ17Updated 5 years ago
- Automatically exported from code.google.com/p/guess-languageβ53Updated last year
- Machine translation for the real worldβ23Updated 5 years ago
- The Global WordNet Association Collaborative Inter-Lingual Indexβ43Updated 8 months ago
- FreeLing project source codeβ257Updated 2 years ago
- A cloud-based, open-source system for writing and publishing dictionaries.β93Updated last year
- eXternally configurable REference and Non Named Entity Recognizerβ17Updated last year
- LingPy: Python library for quantitative tasks in historical linguisticsβ136Updated 4 months ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.β34Updated 2 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (incluβ¦β65Updated last year
- Repository for ru-syntax command line tool.β16Updated 3 years ago
- Machine-readable Wiktionaryβ76Updated last year
- Linguistica 5: Unsupervised Learning of Linguistic Structureβ30Updated 6 years ago
- An LL parser for extracting information from Wiki text, particularly Wiktionary.β49Updated last year
- ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora withβ¦β75Updated last month