larsmans / lucene-stanford-lemmatizerLinks
A library that adds some NLP capabilities to the Lucene search engine
☆50Updated 12 years ago
Alternatives and similar repositories for lucene-stanford-lemmatizer
Users that are interested in lucene-stanford-lemmatizer are comparing it to the libraries listed below
Sorting:
- SIREn - Semi-Structured Information Retrieval Engine☆108Updated 4 years ago
- SKOS Support for Apache Lucene and Solr☆56Updated 4 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- A Query Autofiltering SearchComponent for Solr that can translate free-text queries into structured queries using index metadata☆27Updated 6 years ago
- Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.☆159Updated 2 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 7 years ago
- Elasticsearch Latent Semantic Indexing experimentation☆33Updated 5 years ago
- Java text categorization system☆57Updated 8 years ago
- A text tagger based on Lucene / Solr, using FST technology☆177Updated last year
- RDF-Centric Map/Reduce Framework and Freebase data conversion tool☆149Updated 3 years ago
- Analysis plugin for ElasticSearch providing capability for processing inline annotations in documents.☆35Updated 11 years ago
- Using latent Dirichlet allocation (LDA) in Apache Lucene☆58Updated 12 years ago
- simple simhashing in hadoop with cascading☆33Updated 14 years ago
- Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.☆283Updated 7 years ago
- KEA 5.0 (keyphrase extraction software), modified to be an XML-RPC service☆42Updated 14 years ago
- Taming Text Book Source Code☆382Updated last year
- Keeps a mirror of DBpedia live in sync☆26Updated 3 years ago
- NLP tools developed by Emory University.☆61Updated 9 years ago
- Uncharted Ensemble Clustering is a flexible multi-threaded clustering library for rapidly constructing tailored clustering solutions that…☆32Updated 10 years ago
- Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...☆34Updated 6 years ago
- SKOS analysis for Elasticsearch☆54Updated 9 years ago
- Warcbase is an open-source platform for managing analyzing web archives☆162Updated 7 years ago
- Mirror of Apache Stanbol (incubating)☆114Updated last year
- ElasticSearch OSEM☆22Updated last year
- ElasticSearch Prediction Generator and Plugin☆22Updated 9 years ago
- Stream-based InputFormat for processing the compressed XML dumps of Wikipedia with Hadoop☆85Updated 12 years ago
- Twitter Tools☆220Updated 7 years ago
- Large RDF hierarchies as vector spaces☆20Updated 11 years ago
- Lightweight, multilingual natural language processing☆63Updated 12 years ago
- A bunch of fancy soft string matching routines, with some accompanying datasets☆56Updated 8 years ago