LogicalSpark / docker-tikaserverLinks
Apache Tika Server as a Docker Image
☆172Updated 2 years ago
Alternatives and similar repositories for docker-tikaserver
Users that are interested in docker-tikaserver are comparing it to the libraries listed below
Sorting:
- FacetView is a pure javascript frontend for ElasticSearch.☆291Updated 10 years ago
- Bulk indexing command line tool for elasticsearch.☆281Updated 3 months ago
- A plugin for language detection in Elasticsearch using Nakatani Shuyo's language detector☆252Updated 7 years ago
- Elasticsearch entity resolution plugin based on Duke☆209Updated 5 years ago
- Elasticsearch/Solr Sandbox for exploring explain information and tweaking☆137Updated last year
- A bundle of useful Elasticsearch plugins☆111Updated last year
- a pure javascript frontend for ElasticSearch search indices.☆80Updated 7 years ago
- An Elasticsearch ingest processor to do named entity extraction using Apache OpenNLP☆272Updated 2 years ago
- "Stop worrying about Elasticsearch analyzers", my therapist says☆154Updated 4 years ago
- Convenience Docker images for Apache Tika Server☆191Updated last week
- Mapper Attachments Type plugin for Elasticsearch☆503Updated 2 years ago
- Github mirror of "search/highlighter" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access…☆103Updated last month
- A python library detect and extract listing data from HTML page.☆108Updated 8 years ago
- Dockerfile to run unoconv as a webservice☆96Updated 2 years ago
- Entity resolution for Elasticsearch.☆160Updated 5 months ago
- Index URLs in Common Crawl☆194Updated 7 years ago
- NER toolkit for HTML data☆259Updated last year
- SOLR bulk indexing utility for the command line.☆44Updated 2 months ago
- IMAP and POP3 email importer for Elasticsearch (no river anymore)☆100Updated 3 years ago
- Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser☆13Updated 3 years ago
- Solr AutoComplete implementation☆59Updated 7 years ago
- 💫 REST microservices for various spaCy-related tasks☆240Updated 3 years ago
- ☆91Updated 3 years ago
- Launch AWS Elastic MapReduce jobs that process Common Crawl data.☆49Updated 8 years ago
- Text classification using Naive Bayes and Elasticsearch☆154Updated 8 years ago
- Ingest processor doing language detection for fields☆72Updated 2 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆72Updated 8 years ago
- An RDF plugin for Solr☆115Updated 5 months ago
- A text tagger based on Lucene / Solr, using FST technology☆176Updated last year
- Export SOLR documents efficiently with cursors.☆38Updated 2 months ago