LogicalSpark / docker-tikaserver
Apache Tika Server as a Docker Image
☆171Updated 2 years ago
Alternatives and similar repositories for docker-tikaserver:
Users that are interested in docker-tikaserver are comparing it to the libraries listed below
- Elasticsearch/Solr Sandbox for exploring explain information and tweaking☆137Updated 10 months ago
- Bulk indexing command line tool for elasticsearch.☆280Updated 3 months ago
- A bundle of useful Elasticsearch plugins☆110Updated 9 months ago
- A plugin for language detection in Elasticsearch using Nakatani Shuyo's language detector☆251Updated 7 years ago
- FacetView is a pure javascript frontend for ElasticSearch.☆291Updated 9 years ago
- Github mirror of "search/highlighter" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access…☆100Updated last month
- "Stop worrying about Elasticsearch analyzers", my therapist says☆155Updated 3 years ago
- Mapper Attachments Type plugin for Elasticsearch☆504Updated last year
- An Elasticsearch ingest processor to do named entity extraction using Apache OpenNLP☆270Updated 2 years ago
- Elasticsearch entity resolution plugin based on Duke☆210Updated 4 years ago
- Entity resolution for Elasticsearch.☆158Updated this week
- Text classification using Naive Bayes and Elasticsearch☆154Updated 8 years ago
- a pure javascript frontend for ElasticSearch search indices.☆79Updated 6 years ago
- Carrot2 plugin for ElasticSearch☆292Updated 2 years ago
- Naive Bayes Classifier implemented with Elasticsearch Aggregations☆51Updated 10 years ago
- A reference mechanism for including content from other documents during the Elasticsearch analysis field mapping phase☆35Updated 5 years ago
- A python library detect and extract listing data from HTML page.☆109Updated 7 years ago
- A text tagger based on Lucene / Solr, using FST technology☆176Updated last year
- Tesseract 4 OCR Runtime Environment - Docker Container☆98Updated 5 years ago
- spaCy REST API, wrapped in a Docker container.☆266Updated 2 years ago
- A scrapy pipeline which send items to Elastic Search server☆327Updated 2 years ago
- An RDF plugin for Solr☆114Updated 3 years ago
- NER toolkit for HTML data☆257Updated 8 months ago
- Curated synonym files and Helpers for Elasticsearch Synonym Token Filter☆64Updated last year
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 8 years ago
- Decompounding Plugin for Elasticsearch☆87Updated 3 years ago
- Towards an open source stack for e-commerce search☆145Updated last month
- Demonstration of using Python to process the Common Crawl dataset with the mrjob framework☆166Updated 2 years ago
- Elasticsearch lemmatizer for 15 languages☆104Updated last month
- Model Training tool for MITIE☆79Updated 9 years ago