LogicalSpark / docker-tikaserverLinks
Apache Tika Server as a Docker Image
☆172Updated 3 years ago
Alternatives and similar repositories for docker-tikaserver
Users that are interested in docker-tikaserver are comparing it to the libraries listed below
Sorting:
- A bundle of useful Elasticsearch plugins☆112Updated last year
- A plugin for language detection in Elasticsearch using Nakatani Shuyo's language detector☆252Updated 7 years ago
- An Elasticsearch ingest processor to do named entity extraction using Apache OpenNLP☆274Updated 3 years ago
- a pure javascript frontend for ElasticSearch search indices.☆80Updated 7 years ago
- FacetView is a pure javascript frontend for ElasticSearch.☆291Updated 10 years ago
- spaCy REST API, wrapped in a Docker container.☆267Updated 2 years ago
- Mapper Attachments Type plugin for Elasticsearch☆505Updated 2 years ago
- "Stop worrying about Elasticsearch analyzers", my therapist says☆154Updated 4 years ago
- Elasticsearch/Solr Sandbox for exploring explain information and tweaking☆139Updated last year
- Github mirror of "search/highlighter" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access…☆104Updated last month
- Elasticsearch entity resolution plugin based on Duke☆209Updated 5 years ago
- Carrot2 plugin for ElasticSearch☆291Updated 2 years ago
- Text classification using Naive Bayes and Elasticsearch☆152Updated 9 years ago
- Index URLs in Common Crawl☆197Updated 8 years ago
- A python library detect and extract listing data from HTML page.☆108Updated 8 years ago
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 9 years ago
- Bulk indexing command line tool for elasticsearch.☆282Updated last month
- Naive Bayes Classifier implemented with Elasticsearch Aggregations☆51Updated 11 years ago
- ☆185Updated 7 years ago
- Decompounding Plugin for Elasticsearch☆87Updated 4 years ago
- Entity resolution for Elasticsearch.☆163Updated last month
- 💫 REST microservices for various spaCy-related tasks☆241Updated 3 years ago
- IMAP and POP3 email importer for Elasticsearch (no river anymore)☆102Updated 3 years ago
- Launch AWS Elastic MapReduce jobs that process Common Crawl data.☆49Updated 8 years ago
- SOLR bulk indexing utility for the command line.☆45Updated last week
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆161Updated 5 years ago
- Analyze standard numbers like ARK, DOI, EAN, GTIN, IBAN, ISAN, ISBN, ISMN, ISNI, ISSN, ISTC, ISWC, ORCID, PPN, SICI, UPC, ZDB with Elasti…☆24Updated 9 years ago
- LA-PDFText is a system for extracting accurate text from PDF-based research articles (and an interface to be able to improve performance …☆81Updated 7 years ago
- Elasticsearch Index Termlist☆118Updated 6 years ago
- Evaluating the performance and accuracy of ABBYY FineReader's OCR on Senate Financial Disclosure scanned forms☆134Updated 9 years ago