morfologik / polimorfologikLinks
Scripts for preprocessing morfologik data.
☆40Updated 8 years ago
Alternatives and similar repositories for polimorfologik
Users that are interested in polimorfologik are comparing it to the libraries listed below
Sorting:
- Tools for finite state automata construction and dictionary-based morphological dictionaries. Includes Polish stemming dictionary.☆197Updated 2 years ago
- The Sweble Wikitext Components module provides a parser for MediaWiki's wikitext and an engine trying to emulate the behavior of a MediaW…☆72Updated last year
- Some convenient natural language tools that build on NLTK.☆85Updated 11 years ago
- Basic dataset for the linguistic data collection.☆15Updated 8 years ago
- Zurich Morphological Lexicon for German: a tool to extract a morphological lexicon from Wiktionary☆12Updated 2 years ago
- Deutsch Language Tool Kit☆12Updated 10 years ago
- Fast and robust NLP components implemented in Java.☆53Updated 5 years ago
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆19Updated this week
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆66Updated last week
- Stanford Tregex-inspired language for rule-based dependency tree manipulation.☆21Updated 8 years ago
- Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl,…☆79Updated last week
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 3 years ago
- Models for POS tagging and sentence and tokens detection with OpenNLP tools for italian language☆52Updated 12 years ago
- NameTag: Named Entity Tagger☆37Updated last year
- NER tagger for English, Spanish, Dutch, Italian and German and French.☆35Updated 10 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆13Updated 2 years ago
- WordNet-LMF formats☆24Updated last month
- small Java library for splitting German compound words☆63Updated last year
- Semanticizest: dump parser and client☆20Updated 9 years ago
- GermaNER: Free Open German Named Entity Recognition Tool☆36Updated 2 years ago
- A wrapper, a lemmatizer and REST API implemented in Python for emMorph (Humor) Hungarian morphological analyzer☆11Updated 4 years ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆81Updated 7 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆70Updated last week
- Thot toolkit for statistical machine translation☆53Updated 3 years ago
- A library of examples showing how to use the Common Crawl corpus (2008-2012, ARC format)☆65Updated 9 years ago
- Machine translation for the real world☆23Updated 5 years ago
- A text tagger based on Lucene / Solr, using FST technology☆177Updated 2 years ago
- ☆18Updated 10 years ago
- A parser and autocorrection tool for wiktionary.☆39Updated 10 years ago
- Software and resources for natural language processing.☆132Updated 9 years ago