LogicalSpark / docker-tikaserver
Apache Tika Server as a Docker Image
☆172Updated 2 years ago
Alternatives and similar repositories for docker-tikaserver
Users that are interested in docker-tikaserver are comparing it to the libraries listed below
Sorting:
- A bundle of useful Elasticsearch plugins☆110Updated last year
- FacetView is a pure javascript frontend for ElasticSearch.☆290Updated 10 years ago
- spaCy REST API, wrapped in a Docker container.☆267Updated 2 years ago
- An Elasticsearch ingest processor to do named entity extraction using Apache OpenNLP☆272Updated 2 years ago
- Elasticsearch entity resolution plugin based on Duke☆210Updated 4 years ago
- Mapper Attachments Type plugin for Elasticsearch☆504Updated last year
- ☆184Updated 6 years ago
- a pure javascript frontend for ElasticSearch search indices.☆79Updated 7 years ago
- Entity resolution for Elasticsearch.☆159Updated 4 months ago
- Tesseract 4 OCR Runtime Environment - Docker Container☆99Updated 6 years ago
- A plugin for language detection in Elasticsearch using Nakatani Shuyo's language detector☆252Updated 7 years ago
- Bulk indexing command line tool for elasticsearch.☆281Updated 2 months ago
- Text classification using Naive Bayes and Elasticsearch☆154Updated 8 years ago
- Github mirror of "search/highlighter" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access…☆103Updated last week
- Working with hOCR in Javascript☆127Updated 2 years ago
- SKOS analysis for Elasticsearch☆54Updated 8 years ago
- Evaluating the performance and accuracy of ABBYY FineReader's OCR on Senate Financial Disclosure scanned forms☆131Updated 9 years ago
- Elasticsearch/Solr Sandbox for exploring explain information and tweaking☆137Updated last year
- 💫 REST microservices for various spaCy-related tasks☆240Updated 2 years ago
- Carrot2 plugin for ElasticSearch☆291Updated 2 years ago
- A python library detect and extract listing data from HTML page.☆108Updated 8 years ago
- Naive Bayes Classifier implemented with Elasticsearch Aggregations☆51Updated 11 years ago
- "Stop worrying about Elasticsearch analyzers", my therapist says☆154Updated 3 years ago
- Dockerfile to run unoconv as a webservice☆96Updated 2 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆72Updated 7 years ago
- Model Training tool for MITIE☆79Updated 9 years ago
- Run your own OCR-as-a-Service using Tesseract and Docker☆1,364Updated last year
- Web based JavaScript GUI library for proofreading/editing hOCR☆95Updated 6 years ago
- An expandable and scalable OCR pipeline☆87Updated 7 years ago
- Elasticsearch plugin offering Neo4j integration for Personalized Search☆155Updated 4 years ago