EisenVault / install-tesseract-redhat-centos
Script for downloading and installing Tesseract OCR Engine on RedHat and CentOS
☆52Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for install-tesseract-redhat-centos
- ☆55Updated 5 years ago
- Implementation of Vision Based Page Segmentation algorithm in Java☆101Updated 5 years ago
- A fast and comprehensive Java library capable of performing automaton and non-automaton based Levenshtein distance determination and neig…☆41Updated 11 years ago
- Page Segmentation Code. I'm working with OCRopus and the UW-III data set to test how the page segmentation algorithms work with smaller s…☆20Updated 11 years ago
- Elasticsearch plugin for b-bit minhash algorism☆62Updated 4 months ago
- This provides tools for b-bit MinHash algorism.☆35Updated 10 months ago
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 8 years ago
- OCR evaluation brought to you by University of Alicante☆67Updated 2 years ago
- Extract meaningful content from pdf and psd file, such as texts and images both linked into a common JSON string☆36Updated 6 years ago
- Automatic Table reader. Can extract table data from images.☆15Updated 5 years ago
- Text retrieval database based on simhash similarity search☆24Updated last year
- Solr Redis Extensions☆52Updated 9 months ago
- Document image binarization for Project 3A @Mines_Nancy☆30Updated 7 years ago
- Starter Reverse Proxy Configuration for Solr☆47Updated 9 years ago
- Tesseract 4 OCR Compilation - Docker Container☆53Updated 2 years ago
- Recognition Models for Kraken and CLSTM☆13Updated 5 years ago
- Bachelor Thesis | Text extraction from complex video scenes☆15Updated 5 years ago
- Java text categorization system☆54Updated 7 years ago
- Using OpenCV to detect and correct skew in image of text documents.☆19Updated 10 years ago
- detect the table image in pdf or other format image by opencv and python .☆53Updated 5 years ago
- Simple RESTful API server running your own machine translation model. Docker image modified from mbartoli/easy-smt☆11Updated 5 years ago
- an idiomatic port of FlashText.py to Java using streams☆14Updated last month
- An expandable and scalable OCR pipeline☆86Updated 7 years ago
- Web Content Extraction Through Machine Learning☆185Updated 10 years ago
- A small framework taking over the manual training process described in the Tesseract3 Wiki: https://code.google.com/p/tesseract-ocr/wiki/…☆130Updated last year
- Java JNA Wrapper for Leptonica Image Processing Library☆27Updated 2 weeks ago
- Similarity hashing☆48Updated 13 years ago
- Document Layout Analysis Projects☆23Updated 5 years ago
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆101Updated last year
- An implementation of RESTful web service for tesseract-OCR using tornado☆135Updated last year