morfologik / polimorfologikLinks
Scripts for preprocessing morfologik data.
☆40Updated 7 years ago
Alternatives and similar repositories for polimorfologik
Users that are interested in polimorfologik are comparing it to the libraries listed below
Sorting:
- Tools for finite state automata construction and dictionary-based morphological dictionaries. Includes Polish stemming dictionary.☆193Updated last year
- A Python port of the Apache Lucene ASCII Folding Filter that converts alphabetic, numeric, and symbolic Unicode characters which are not …☆15Updated 5 years ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆81Updated 6 years ago
- Polish morphological tagger.☆43Updated 2 years ago
- ☆18Updated 9 years ago
- The Zurich Dependency Parser for German☆85Updated 2 years ago
- Basic dataset for the linguistic data collection.☆15Updated 8 years ago
- Elasticsearch lemmatizer for 15 languages☆106Updated 6 months ago
- Stanford Tregex-inspired language for rule-based dependency tree manipulation.☆21Updated 8 years ago
- The Sweble Wikitext Components module provides a parser for MediaWiki's wikitext and an engine trying to emulate the behavior of a MediaW…☆72Updated last year
- Resources for doing NLP in Polish☆47Updated 5 years ago
- MorphoDiTa: Morphologic Dictionary and Tagger☆73Updated last year
- Slovak support for Elastic Search (with Dockerfile)☆18Updated 5 years ago
- KEA - Keyphrase Extraction Algorithm☆23Updated 9 years ago
- Python lemmatizer for Polish.☆18Updated 5 years ago
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆17Updated this week
- Python port of Stempel, an algorithmic stemmer for Polish language.☆38Updated 9 months ago
- This is a Fact based Question Answering System using Apache Solr as backend search engine, Wikipedia dumps as information source, Apache …☆26Updated 2 years ago
- TiMBL implements several memory-based learning algorithms.☆52Updated this week
- Unitex/GramLab Language Resources☆19Updated 2 years ago
- Standalone versions of LUCENE_5205 and other patches: SpanQueryParser, Concordance and Co-occurrence stats☆18Updated 3 years ago
- small Java library for splitting German compound words☆63Updated last year
- A text tagger based on Lucene / Solr, using FST technology☆176Updated last year
- Unitex/GramLab C++ Core☆23Updated last year
- Python port for IWNLP.Lemmatizer☆17Updated last year
- A trend viewer written in Python/JavaScript☆21Updated 7 months ago
- Abydos NLP/IR library for Python☆186Updated 2 years ago
- Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipg…☆127Updated 6 months ago
- Baseform lemmatization for Elasticsearch☆26Updated 6 years ago
- Search back-end for dependency tree search. See the docs at https://fginter.github.io/dep_search/☆17Updated 7 years ago