EisenVault / install-tesseract-redhat-centosLinks
Script for downloading and installing Tesseract OCR Engine on RedHat and CentOS
☆53Updated 7 years ago
Alternatives and similar repositories for install-tesseract-redhat-centos
Users that are interested in install-tesseract-redhat-centos are comparing it to the libraries listed below
Sorting:
- (Java)A Method to Extract Tabular Content from PDF Files☆335Updated 2 years ago
- A fast and comprehensive Java library capable of performing automaton and non-automaton based Levenshtein distance determination and neig…☆43Updated 12 years ago
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 9 years ago
- Java JNA Wrapper for Leptonica Image Processing Library☆30Updated this week
- Pdf2Dom is a PDF parser that converts the documents to a HTML DOM representation. The obtained DOM tree may be then serialized to a HTM…☆186Updated 2 years ago
- Detect and fix skew in images containing text☆267Updated 6 years ago
- Integration between Stanford NLP and Apache Stanbol☆34Updated 9 years ago
- Files and Scripts to run Tesseract 5 LSTM Training using fonts☆79Updated 3 years ago
- This provides tools for b-bit MinHash algorism.☆36Updated 2 months ago
- Apache Tika Server as a Docker Image☆172Updated 3 years ago
- ☆16Updated 8 years ago
- Tesseract 4 OCR Compilation - Docker Container☆54Updated 3 years ago
- Intent recognition with OpenNLP☆156Updated 7 years ago
- Implementation of Vision Based Page Segmentation algorithm in Java☆102Updated 5 years ago
- Java interface for fastText☆239Updated 2 years ago
- Elasticsearch Index Termlist☆117Updated 6 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆395Updated 11 months ago
- Fork of Program AB, the reference implementation of the AIML 2.0 draft specification. AIML is a widely adopted standard for creating chat…☆60Updated 8 years ago
- This tool extracts word vectors from Lucene index.☆135Updated 7 years ago
- A language detection library for the JVM☆36Updated last year
- Repository collecting all the submodules for the new PyTorch-based OCR System.☆142Updated 4 years ago
- Java text categorization system☆56Updated 8 years ago
- OCR evaluation brought to you by University of Alicante☆68Updated 2 years ago
- Python API for Various DB-Backed Simhash Clusters☆64Updated 8 years ago
- Github mirror of "search/highlighter" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access…☆102Updated last week
- NLP tools developed by Emory University.☆60Updated 8 years ago
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆107Updated 2 years ago
- Java/JNI bindings to libpostal for for fast international street address parsing/normalization☆125Updated 3 weeks ago
- An Elasticsearch ingest processor to do named entity extraction using Apache OpenNLP☆272Updated 2 years ago
- Line segmentation algorithm for Google Vision API.☆96Updated 2 years ago