morfologik / morfologik-stemming
Tools for finite state automata construction and dictionary-based morphological dictionaries. Includes Polish stemming dictionary.
☆188Updated last year
Alternatives and similar repositories for morfologik-stemming:
Users that are interested in morfologik-stemming are comparing it to the libraries listed below
- Scripts for preprocessing morfologik data.☆39Updated 7 years ago
- Query preprocessor for Java-based search engines (Querqy Core and Solr implementation)☆183Updated last week
- A text tagger based on Lucene / Solr, using FST technology☆176Updated last year
- Various utilities regarding Levenshtein transducers. (Java)☆57Updated 3 years ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆81Updated 6 years ago
- Dice Solr Plugins from Simon Hughes Dice.com☆87Updated 3 years ago
- Search Quality Evaluation Tool for Apache Solr & Elasticsearch search-based infrastructures☆180Updated 9 months ago
- An Elasticsearch ingest processor to do named entity extraction using Apache OpenNLP☆270Updated 2 years ago
- Elasticsearch lemmatizer for 15 languages☆104Updated last month
- Word2Vec Java Port☆187Updated 6 years ago
- Java port of SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm☆66Updated 4 years ago
- Machine learning components for Apache UIMA☆129Updated last year
- This tool extracts word vectors from Lucene index.☆134Updated 7 years ago
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆160Updated 4 years ago
- Solr Query Segmenter for structuring unstructured queries☆21Updated 3 years ago
- Hardened Fork of Ranklib learning to rank library☆44Updated 2 years ago
- small Java library for splitting German compound words☆61Updated 8 months ago
- Browser-driven explorer for lucene indexes☆74Updated 3 years ago
- Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.☆196Updated 2 months ago
- extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format.☆127Updated 10 months ago
- ☆184Updated 6 years ago
- Ingest processor doing language detection for fields☆71Updated 2 years ago
- Dice.com tutorial on using black box optimization algorithms to do relevancy tuning on your Solr Search Engine Configuration from Simon H…☆28Updated 5 years ago
- A Java implementation of the Rapid Automatic Keyword Extraction Framework ( RAKE )☆29Updated 6 years ago
- Java implementation of the TextRank algorithm by Mihalcea, et al.☆75Updated 3 years ago
- Polish morphological tagger.☆42Updated last year
- Named Entity Recognition data for Europeana Newspapers☆171Updated last year
- NLP framework for JVM languages.☆148Updated 3 years ago
- Elasticsearch/Solr Sandbox for exploring explain information and tweaking☆137Updated 10 months ago
- Java text categorization system☆55Updated 7 years ago