EisenVault / install-tesseract-redhat-centosLinks
Script for downloading and installing Tesseract OCR Engine on RedHat and CentOS
☆52Updated 7 years ago
Alternatives and similar repositories for install-tesseract-redhat-centos
Users that are interested in install-tesseract-redhat-centos are comparing it to the libraries listed below
Sorting:
- ☆16Updated 8 years ago
- A high performance similariy search service with faiss inside.☆27Updated 7 years ago
- Pre-Recognize Library - library with algorithms for improving OCR quality.☆104Updated 2 years ago
- SuperMinHash: A New Minwise Hashing Algorithm for Jaccard Similarity Estimation, Simhash and SimhashIndex☆19Updated 2 years ago
- Page Segmentation Code. I'm working with OCRopus and the UW-III data set to test how the page segmentation algorithms work with smaller s…☆20Updated 12 years ago
- detect the table image in pdf or other format image by opencv and python .☆54Updated 5 years ago
- A fast and comprehensive Java library capable of performing automaton and non-automaton based Levenshtein distance determination and neig…☆42Updated 12 years ago
- Scripts and results from our OCR roundup, available on Source☆150Updated 6 years ago
- Implementation of Vision Based Page Segmentation algorithm in Java☆102Updated 5 years ago
- A library for efficient similarity search and clustering of dense vectors. It's a Go wrapper of faiss (https://github.com/facebookresearc…☆24Updated 2 years ago
- Perceptual Hash project for Videos (MMAI Term Project)☆27Updated 11 years ago
- Solr AutoComplete implementation☆59Updated 7 years ago
- Automatically exported from code.google.com/p/fire-cbir☆31Updated 10 years ago
- A plugin for language detection in Elasticsearch using Nakatani Shuyo's language detector☆252Updated 7 years ago
- Optical table recognition - recognize tables in scan images using OpenCV☆112Updated 5 years ago
- Training/test data for Dragnet☆41Updated 10 years ago
- Information Extraction System can perform NLP tasks like Named Entity Recognition, Sentence Simplification, Relation Extraction etc.☆27Updated 11 years ago
- Fast Word Segmentation with Triangular Matrix☆81Updated 3 years ago
- Document image binarization for Project 3A @Mines_Nancy☆30Updated 7 years ago
- This provides tools for b-bit MinHash algorism.☆36Updated 2 weeks ago
- Elasticsearch plugin for b-bit minhash algorism☆63Updated 11 months ago
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 9 years ago
- Repository collecting all the submodules for the new PyTorch-based OCR System.☆142Updated 4 years ago
- liberate all kinds of data from PDF and other unstructural format and make the information machine-readable and visualizeable for popul…☆31Updated 7 years ago
- An efficient data structure for fast string similarity searches☆22Updated 4 years ago
- Detecting near duplicates usign Moses Charikars Algorithm☆20Updated 10 years ago
- Detect and fix skew in images containing text☆265Updated 6 years ago
- Rotation and skew detection using DL.☆59Updated 7 years ago
- "结巴"中文分词的C++版本,使用 darts Double Array Trie 降低内存占用到 1/100☆50Updated 2 years ago
- Java text categorization system☆56Updated 8 years ago