morfologik / morfologik-stemming
Tools for finite state automata construction and dictionary-based morphological dictionaries. Includes Polish stemming dictionary.
☆188Updated last year
Related projects ⓘ
Alternatives and complementary repositories for morfologik-stemming
- Scripts for preprocessing morfologik data.☆39Updated 6 years ago
- Query preprocessor for Java-based search engines (Querqy Core and Solr implementation)☆183Updated this week
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆160Updated 4 years ago
- A text tagger based on Lucene / Solr, using FST technology☆176Updated 11 months ago
- Elasticsearch Index Termlist☆117Updated 5 years ago
- Carrot2 plugin for ElasticSearch☆292Updated last year
- An Elasticsearch ingest processor to do named entity extraction using Apache OpenNLP☆269Updated 2 years ago
- Morfologik Polish Lemmatizer plugin for Elasticsearch☆85Updated last week
- "Stop worrying about Elasticsearch analyzers", my therapist says☆154Updated 3 years ago
- A plugin for language detection in Elasticsearch using Nakatani Shuyo's language detector☆251Updated 6 years ago
- Elasticsearch lemmatizer for 15 languages☆104Updated 5 months ago
- Automatically exported from code.google.com/p/universal-pos-tags☆128Updated 2 years ago
- Language Detection Library for Java☆569Updated 2 years ago
- Browser-driven explorer for lucene indexes☆74Updated 3 years ago
- Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.☆196Updated this week
- Named Entity Recognition data for Europeana Newspapers☆173Updated last year
- Elasticsearch term position similarity plugin☆70Updated last year
- Elasticsearch entity resolution plugin based on Duke☆210Updated 4 years ago
- Example of Now Deprecated Native Script Plugin for Elasticsearch☆131Updated 7 years ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆80Updated 6 years ago
- Python port of Stempel, an algorithmic stemmer for Polish language.☆35Updated 2 months ago
- extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format.☆126Updated 8 months ago
- Java text categorization system☆54Updated 7 years ago
- Ingest processor doing language detection for fields☆71Updated 2 years ago
- Stemmer for German☆45Updated 2 years ago
- Apache Joshua☆105Updated 4 years ago
- Dice Solr Plugins from Simon Hughes Dice.com☆87Updated 3 years ago
- Hardened Fork of Ranklib learning to rank library☆44Updated 2 years ago
- The Sweble Wikitext Components module provides a parser for MediaWiki's wikitext and an engine trying to emulate the behavior of a MediaW…☆70Updated 7 months ago
- A high performance "thin wrapper" HTTP REST server on top of Apache Lucene☆137Updated 6 months ago