morfologik / morfologik-stemmingLinks
Tools for finite state automata construction and dictionary-based morphological dictionaries. Includes Polish stemming dictionary.
☆192Updated last year
Alternatives and similar repositories for morfologik-stemming
Users that are interested in morfologik-stemming are comparing it to the libraries listed below
Sorting:
- Scripts for preprocessing morfologik data.☆40Updated 7 years ago
- A text tagger based on Lucene / Solr, using FST technology☆176Updated last year
- Browser-driven explorer for lucene indexes☆74Updated 3 years ago
- Various utilities regarding Levenshtein transducers. (Java)☆57Updated 3 years ago
- Query preprocessor for Java-based search engines (Querqy Core and Solr implementation)☆184Updated this week
- Named Entity Recognition data for Europeana Newspapers☆171Updated 2 years ago
- Program used to split text into segments☆26Updated 7 months ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆81Updated 6 years ago
- extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format.☆128Updated last year
- small Java library for splitting German compound words☆63Updated last year
- Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.☆198Updated 6 months ago
- NLP framework for JVM languages.☆148Updated 4 years ago
- Elasticsearch lemmatizer for 15 languages☆106Updated 5 months ago
- Java port of SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm☆67Updated 4 years ago
- Machine learning components for Apache UIMA☆129Updated last year
- The Sweble Wikitext Components module provides a parser for MediaWiki's wikitext and an engine trying to emulate the behavior of a MediaW…☆72Updated last year
- This tool extracts word vectors from Lucene index.☆135Updated 7 years ago
- This provides tools for b-bit MinHash algorism.☆36Updated last week
- Elasticsearch entity resolution plugin based on Duke☆210Updated 5 years ago
- MorphoDiTa: Morphologic Dictionary and Tagger☆73Updated last year
- Elasticsearch Index Termlist☆117Updated 6 years ago
- Vector Plugin for Solr: calculate dot product / cosine similarity on documents☆14Updated 6 years ago
- Querqy for Elasticsearch☆46Updated last month
- This is a Fact based Question Answering System using Apache Solr as backend search engine, Wikipedia dumps as information source, Apache …☆26Updated 2 years ago
- An Elasticsearch ingest processor to do named entity extraction using Apache OpenNLP☆272Updated 2 years ago
- "Stop worrying about Elasticsearch analyzers", my therapist says☆154Updated 4 years ago
- DKPro JWPL (DKPro Java Wikipedia Library) is a free, Java-based application programming interface that facilitates access to all informat…☆86Updated last week
- Solr Query Segmenter for structuring unstructured queries☆21Updated 4 years ago
- Carrot2 plugin for ElasticSearch☆291Updated 2 years ago
- command line tool for Apache Lucene☆162Updated 2 months ago