EisenVault / install-tesseract-redhat-centosLinks
Script for downloading and installing Tesseract OCR Engine on RedHat and CentOS
☆53Updated 7 years ago
Alternatives and similar repositories for install-tesseract-redhat-centos
Users that are interested in install-tesseract-redhat-centos are comparing it to the libraries listed below
Sorting:
- Tesseract 4 OCR Compilation - Docker Container☆56Updated 3 years ago
- Repository collecting all the submodules for the new PyTorch-based OCR System.☆142Updated 4 years ago
- Some native scoring script plugins for elasticsearch☆29Updated 5 years ago
- A simple program to extract the text from an image before performing OCR☆222Updated 5 years ago
- (Java)A Method to Extract Tabular Content from PDF Files☆335Updated 2 years ago
- Detect and fix skew in images containing text☆268Updated 6 years ago
- Recognition Models for Kraken and CLSTM☆16Updated 6 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆402Updated last year
- Content Based Image Retrieval Plugin for Elasticsearch. It allows users to index images and search for similar images.☆408Updated 9 years ago
- An implementation of RESTful web service for tesseract-OCR using tornado☆136Updated 2 years ago
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆110Updated 2 years ago
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 9 years ago
- Text classification using Naive Bayes and Elasticsearch☆154Updated 9 years ago
- A plugin for language detection in Elasticsearch using Nakatani Shuyo's language detector☆252Updated 7 years ago
- This tool extracts word vectors from Lucene index.☆135Updated 7 years ago
- Integration between Stanford NLP and Apache Stanbol☆34Updated 9 years ago
- An Elasticsearch ingest processor to do named entity extraction using Apache OpenNLP☆274Updated 3 years ago
- Apache Tika Server as a Docker Image☆172Updated 3 years ago
- Go to: https://github.com/alexklibisz/elastiknn☆250Updated 5 years ago
- perspective correction for document image☆20Updated 12 years ago
- Tesseract 4 OCR Runtime Environment - Docker Container☆101Updated 6 years ago
- OCR evaluation brought to you by University of Alicante☆66Updated 3 years ago
- Detect text with stroke width transform.☆333Updated 9 years ago
- detect the table image in pdf or other format image by opencv and python .☆54Updated 6 years ago
- Github mirror of "search/highlighter" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access…☆104Updated 3 weeks ago
- A fast and comprehensive Java library capable of performing automaton and non-automaton based Levenshtein distance determination and neig…☆44Updated 12 years ago
- Mapping photos of Old New York☆293Updated 11 months ago
- 🖺 OCR using tensorflow with attention☆646Updated 6 years ago
- Pdf2Dom is a PDF parser that converts the documents to a HTML DOM representation. The obtained DOM tree may be then serialized to a HTM…☆191Updated 3 years ago
- Optical table recognition - recognize tables in scan images using OpenCV☆112Updated 6 years ago