LogicalSpark / docker-tikaserverLinks
Apache Tika Server as a Docker Image
☆172Updated 3 years ago
Alternatives and similar repositories for docker-tikaserver
Users that are interested in docker-tikaserver are comparing it to the libraries listed below
Sorting:
- A bundle of useful Elasticsearch plugins☆112Updated last year
- A plugin for language detection in Elasticsearch using Nakatani Shuyo's language detector☆251Updated 8 years ago
- FacetView is a pure javascript frontend for ElasticSearch.☆291Updated 10 years ago
- Mapper Attachments Type plugin for Elasticsearch☆504Updated 2 years ago
- "Stop worrying about Elasticsearch analyzers", my therapist says☆154Updated 4 years ago
- spaCy REST API, wrapped in a Docker container.☆267Updated 2 years ago
- a pure javascript frontend for ElasticSearch search indices.☆80Updated 7 years ago
- Github mirror of "search/highlighter" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access…☆104Updated 3 weeks ago
- Elasticsearch/Solr Sandbox for exploring explain information and tweaking☆139Updated last year
- An Elasticsearch ingest processor to do named entity extraction using Apache OpenNLP☆275Updated 3 years ago
- Carrot2 plugin for ElasticSearch☆293Updated 2 years ago
- Entity resolution for Elasticsearch.☆164Updated 2 months ago
- Elasticsearch entity resolution plugin based on Duke☆209Updated 5 years ago
- SOLR bulk indexing utility for the command line.☆45Updated last month
- Text classification using Naive Bayes and Elasticsearch☆152Updated 9 years ago
- Naive Bayes Classifier implemented with Elasticsearch Aggregations☆51Updated 11 years ago
- Bulk indexing command line tool for elasticsearch.☆283Updated 2 months ago
- 💫 REST microservices for various spaCy-related tasks☆241Updated 3 years ago
- A reference mechanism for including content from other documents during the Elasticsearch analysis field mapping phase☆36Updated 6 years ago
- Decompounding Plugin for Elasticsearch☆87Updated 4 years ago
- A Docker build for Solr, to manage the official Docker hub solr image☆447Updated 3 years ago
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 9 years ago
- Tesseract 4 OCR Runtime Environment - Docker Container☆101Updated 6 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆404Updated last year
- ☆92Updated 3 years ago
- ☆185Updated 7 years ago
- Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & N…☆275Updated 3 years ago
- Rich browser-based frontend for elasticsearch☆102Updated 10 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆73Updated 8 years ago
- An expandable and scalable OCR pipeline☆89Updated 8 years ago